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Abstract 



This thesis addresses the problem of developing a quantum counter-part of the well estab- 
lished classical theory of control. We dwell on the fundamental fact that quantum states are 
generally not perfectly distinguishable, and quantum measurements typically introduce noise 
in the system being measured. Because of these, it is generally not clear whether the central 
concept of the classical control theory — that of observing the system and then applying 
feedback — is always useful in the quantum setting. 

We center our investigations around the problem of transforming the state of a quantum 
system into a given target state, when the system can be prepared in different ways, and the 
target state depends on the choice of preparation. We call this the quantum tracking problem 
and show how it can be formulated as an optimization problem that can be approached both 
numerically and analytically. This problem provides a simple route to the characterization 
of the quantum trade-off between information gain and disturbance, and is seen to have 
several applications in quantum information. 

In order to characterize the optimality of our tracking procedures, some figure-of-merit 
has to be specified. Naturally, distance measures for quantum states are the ideal candidates 
for this purpose. We investigated several possibilities, and found that there is usually a 
compromise between physically motivated and mathematically tractable measures. We also 
introduce an alternative to the Uhlmann-Jozsa fidelity for mixed quantum states, which 
besides reproducing a number of properties of the standard fidelity, is especially attractive 
because it is simpler to compute. 

We employ some ideas of convex analysis to construct optimal control schemes ana- 
lytically. In particular, we obtain analytic forms of optimal controllers for stabilizing and 
tracking any pair of states of a single-qubit. In the case of stabilization, we find that feedback 
control is always useful, but because of the trade-off between information gain and distur- 
bance, somewhat different from the type of feedback performed in classical systems. In the 
case of tracking, we find that feedback is not always useful, meaning that depending on the 
choice of states one wants to achieve, it may be better not to introduce any noise by the 
application of quantum measurements. We also demonstrate that our optimal controllers are 
immediately applicable in several quantum information applications such as state-dependent 
cloning, purification, stabilization, and discrimination. In all of these cases, we were able to 
recover and extend previously known optimal strategies and performances. 

Finally we show how optimal single-step control schemes can be concatenated to provide 
multi-step strategies that usually over-perform optimal control protocols based on a single 
interaction between the controller and the system. 
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Introduction 



Broadly speaking, to control a physical system is to modify its natural evolution towards 
some preferred dynamics. An obvious example is steering a car along a sinuous highway. An 
attentive driver rotates the steering wheel in order to preserve the car on the highway - 
obviously preferable than the natural straight line trajectory. Even in this simple example, 
three pillars of Control Theory can already be recognized: 

1. Characterization of the possible ways of influencing the dynamics of the system (in the 
example, a rotation of the steering wheel); 

2. A clear description of the control goal (in the example, to preserve the car on the 
highway) ; 

3. A method to determine a control action that, if does not precisely achieve it, at least 
approximates the goal (in the example this is implemented in the driver's brain, which 
induces a suitable movement of the arms based on the observation of the highway). 

Feedback control, a central concept of this theory, appears in the third point of our 
example: the direction taken by the car is conditioned on the observation of the highway. 
In fact, a driver who is asked to close their eyes and change lanes usually drives the car off 
the road at an angle. This very same idea of conditioning an action on the outcome of a 
measurement has found many technological applications in aircraft flight control, fabrication 
of fiber optics cables, robotics and many others. 

Behind all these triumphs of classical feedback is the fact that — at least in principle - 
there is no cost associated to the extraction of information from classical systems. In contrast, 
quantum measurements cannot perfectly distinguish all states and necessarily disturb the 
measured system. In practice, the information gained from a quantum measurement may 
not compensate for the disturbance caused by it. As a result, the characterization of an 
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unknown quantum system is fundamentally more difficult, rendering the role of feedback in 
the quantum domain disputable. 

In this thesis we study how measurements on finite dimensional quantum systems can be 
designed in order to optimize this intrinsic trade-off between information gain and distur- 
bance, aiming at the design of optimal feedback schemes for quantum control. As demon- 
strated by Fuchs and Peres |J, there exists an entire range of generalized measurements that 
trade-off information gain and disturbance, and as recognized by Doherty, Jacobs and Fuchs 
H [7J, it is the basic problem of quantum control to choose one such that feedback offers 
more help than hinderance. 

The theory of quantum feedback control started to be developed in the 70 's with the work 
of Belavkin in the mathematical physics literature []8], However, due to the experimen- 
tal limitations of that time, the field developed almost exclusively along a mathematically 
abstract direction, until reappearing in the 90's in the quantum optics literature [TD], |TTj], 
encountering much richer experimental possibilities and practical problems where the theory 
could be successfully applied [|T^, |TJ, [TJ], [TS|, |TS, |T7| . 



Here, we introduce a toy feedback control problem that brings new perspectives into the 
study of the trade-off between information gain and disturbance. We consider the quantum 
tracking task of transforming the state of a quantum system into a given target state, when 
the system can be prepared in different ways, and the target state depends on the choice of 
preparation. It is not difficult to see that tracking can be formulated as a transformation 



between two sequences of density matrices, as illustrated in Fig. |TTT| . The source sequence 
models the uncertainty of the initial preparation; each density matrix has some probability 
of being the actual state of the system. The target sequence, in turn, is formed by those 
states that we would like to output for each initial preparation. Ideally, we would like to be 
able to perfectly map one sequence into the other, no matter what the initial preparation 
was. Since this is usually impossible, we look for optimal approximations of these sequences. 
A choice of measurement and feedback that minimizes some notion of distance between these 
sequences is optimizing some trade-off between information gain and disturbance || ]6|, [7]. 

Source Target 
/ pi .N^sequence /, ■p 1 \^ sequence 

I : | liri : / 



Figure 1.1: Schematic of the basic tracking problem approached in the thesis. 



The approach adopted for the discussion of the tracking problem is based on the three 
pillars of the control theory enumerated before: (1) the theory of Quantum Operations (or 
Quantum Channels) provides a mathematically rigorous description of which operations are 
and are not possible from a physical perspective. (2) The task of enforcing a particular 
dynamic is formalized via an optimization problem (over the set of quantum channels) that 
attempts to minimize some distance measure between the output states and the target states. 
(3) Depending on certain technical details, the resulting mathematical problem can be cast 
as, or approximated by a Semidefinite Program (SDP) |Tj| - a special type of convex 
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optimization problem that, computationally, can be efficiently solved in polynomial time. 
The theories of quantum operations, distance measures and convex optimization form the 
key background material underlying the formulation and solution of the quantum control 
problems tackled in this thesis. 

Regarding applications, the problem of approximating sequences of density matrices 
turns out to be sufficiently general to encompass a number of interesting problems in quan- 
tum information science. Its optimal solution provides optimal schemes for tasks such as 
state-dependent quantum cloning, quantum error correction, quantum state purification and 
quantum state discrimination. Besides, it illustrates in a clear-cut way some key departures 
between the optimal quantum and classical control theories. 

In the remainder of this chapter we outline the organization of the thesis and the new 
contributions made to the field. Loosely, it can be divided in two parts: In Chapters |2| and |^, 
we review some background material useful for the formulation of the single-step tracking 
problem as an optimization problem. From Chapters [| to |7], we focus on solving this problem 
and a multi-step variation. A more detailed description of the content of each chapter is as 
follows. 

In Chapter ^| we review the relevant concepts of quantum operations and semidefinite 
programming. Although most of this chapter is devoted to well established results of these 
two fields, some new and more specific material is also introduced here. For example, some 
of the adopted notation is defined, non-standard representations for quantum operations 
are derived and, most importantly, the first steps towards the formulation of the single-step 
tracking problem as a SDP are given in the last section. 

Chapter [| merges a review on standard distance measures for quantum states with the 
introduction of a new measure identified in the course of the present work Q. After a 
thorough evaluation of its properties, the latter is argued to be an easy-to-compute alterna- 
tive definition of the Uhlmann-Jozsa fidelity between mixed states. In our quantum control 
framework, distance measures between quantum states characterize the objective function 
to be optimized by the controller. In this sense, the results of this chapter are relevant for 
the identification of figures-of-merit that are both physically motivated and mathematically 
tractable when used in the formulation of optimization problems for quantum control. 

Chapter f| brings together the results of the two preceding chapters by assembling a num- 
ber of optimization problems, each of which is associated with one of the distance measures 
from Chapter |^. The realizability of each problem as an SDP is discussed and their solu- 
tions are compared in an attempt to measure the sensitivity of optimal quantum control to 
different choices of figures-of-merit. 

In Chapter [| we present our first analytical results on the solution of a specific type 
of the single-step tracking. The source and target sequences are restricted to contain two 
pure states of a single qubit, in such a way that the tracking task reduces to the stabi- 
lization of an uncertain preparation of a single qubit against dephasing noise. Exploiting 
the symmetry of the states involved and a particular representation of the dephasing map, 
quantum and "classical" channels are analytically constructed and proved to be optimal for 
the stabilization task. Most results of this chapter were presented in Ref. 0. 

In Chapter || the single-step tracking problem for pairs of qubits is approached in its 
full generality and, nevertheless, analytically solved. Several applications of this solution 
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are provided in the grounds of quantum information science. In particular, reproduction 
and extension of optimal schemes for quantum state discrimination, state purification, state 
stabilization and state-dependent quantum cloning are obtained. Except for some minor 
adjustments, this chapter reproduces Ref. [||]. 

Chapter [7] presents some preliminary results in the formulation and solution of a multi- 
step control variant of the tracking problem. Here, we face the problem of designing multiple 
control interactions that attempt to track the (uncertain) input density matrices to some 
desired target. This problem is sufficiently difficult not to be approached from an analytical 
perspective and we describe a heuristic that, based on the application of the optimal single- 
step tracking solution, usually leads to multi-step control strategies that over-perform the 
optimal single-step control scheme. 

Chapter |8| summarizes our main results and presents some possibilities of future research. 



2 

Quantum Operations, Semidefinite Programs 

and Quantum Control 



2.1 Introduction 



This chapter covers some background material on quantum operations and semidefinite pro- 
gramming. For a long while, these topics were developed as independent research fields 
of quantum mechanics and optimization theory, respectively. Recently, Audernaert and de 
Moor [|19|] recognized that the problem of determining optimal quantum operations for a 
given task can sometimes be cast as a semidefinite program. Ever since, connections be- 
tween these two topics have been further developed and explored in quantum information 
science. This thesis is one result of this symbiosis. 

The selection of topics reviewed in this chapter aims to achieve the competing purposes of 
making the thesis as objective and self-contained as possible: only the essential concepts and 
a few technical tools required for the remaining chapters are presented. Along these lines, 
many important theorems are stated without complete proofs, in which cases references are 
given. In fact, derivations of mathematical results are only presented when they are, on 
their own, a revision of a useful concept/tool. In contrast, a rather complete account of the 
theory of convex optimization and semidefinite programming can be found in the book by 
Boyd and Varderberghe [^DJ and in the review article by the same authors [IB"]. Modern 
reviews on the theory of quantum operations can be found in the Ch. 8 of Ref . [^]J , Ch 



10 



and 11 of Ref. 



and Ch. 5 of Ref. 



This chapter is divided as follows. In Sees. [2.2| and 2.3 we review, respectively, the key 



concepts of quantum operations and semidefinite programming. Sec. |2.4| merges the results 
of the previous sections to show that the optimization problems over the set of quantum 
operations typically have a "Semidefinite Programming flavor", which may or may not be 
confirmed depending on the specific details of the control task of interest. 
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2.2 Quantum operations 

Is there a well defined mathematical model for every possible dynamics a quantum system can 
incur? When the system is closed, then the Schrodinger equation provides a widely accepted 
model for quantum evolution, namely, any unitary conjugation of the density operator. The 
question becomes more subtle for open quantum systems. This section gives a short account 
on the dynamical model we adopt in this case. 

In Sec. p.2.1| we review the concept of completely positive maps from a mathematical 
viewpoint. These maps will be shown, in Sec. |2.2.2| , to provide an adequate model for 



describing open quantum dynamics. In Sec. [2.2.3| we present a number of technical results 



related to the set of CP maps and some subsets of interest. 



2.2.1 Completely Positive Maps 

In the most general framework, Completely Positive (CP) maps are certain types of trans- 
formations defined between abstract C*-algebras P^| . For our purposes, it will suffice to 
restrict to endomorphisms of the finite dimensional algebra .Md of d x d complex matrices^. 
In what follows, we define CP maps within this particular framework. 

Definition 2.1. A matrix A £ A4d is positive semidefinite (PSD), denoted A > 0, if A* = A 
and every eigenvalue of A is non-negative. 

Definition 2.2. A linear map V : M. d — > A4d is positive ifV(A) > for every A > 0. 

Definition 2.3. Let fC : M d — > M d and define JC n : M n ®M& — > M n ®M& by K. n = J n <g>/C, 
where X n is the identity map on the elements of A4 n . Then JC is called n-positive if K n is 
positive. A map that is n-positive for all values of n is termed completely positive. 



The definitions above do not make it clear whether positive but not CP maps exist. In 
addition, definition |2.3| does not provide a viable way of confirming a given linear map as 
CP: testing n-positivity for increasing values of n can only be conclusive if a violation is 
found at some stage. 

who provided examples of 



These two points were first addressed by Stinespring [24 



positive maps which failed to be 2-positive and derived a new criterion for CP-ness, stated 



below as Theorem |2.1| . Further developments came by with the work by Kraus [^| , where an 

In 1975, Choi |26| rediscovered Kraus 



alternative CP criterion was obtained, Theorem 
earlier result with an independent proof that led to yet another new criterion, Theorem |2~5 . 
In addition, 



in Ref. |27| Choi proved that for every d > 2 there exists (d — l)-positive maps 
which are not d-positive, but every d-positive map is automatically CP. With this, Choi 
proved it possible to confirm a map as CP with d "n-positivity tests". 



1 This is a C*-algebra of operators on the Hilbert space Hd = C d if each matrix A 6 .Md is seen as an 
operator on C d . The algebra is equipped with the standard operator norm \\A\\ (the largest singular value 
of the matrix A) and the ^-involution is taken as the conjugate-transpose, so that the required C* -norm 
property \\A^A\\ = \\A\\ 2 holds. 
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Theorem 2.1 (Stinespring [24]). A linear map JC : Aid — > Ai d is CP if and only if there 
exists, for some dimension d' , a rectangular matrix V of size d x dd' such that K, can be 
"factorized" as 

JC(A) = V(A® 1 A ,)V ] VAeM d . (2.1) 
Proof. That any map of the form (|2.1|) is CP is immediate. For a proof of the converse, we 



refer the reader to Ref. |23j, pp. 357-361]. □ 



Theorem 2.2 (Kraus P3| , Choi PEfl). A linear map JC : Aid —> Aid is CP if and only if 
there exists K\, . . . , e Aid such that JC admits a decomposition of the form 

k 

JC(A) = K i AK \ VAeM d . (2.2) 

i=i 



Proof. That any map of the form ( |2.2| ) is CP is immediate. To see that every CP map can be 
written in this form, one can simply define the operators Ki such that V^r = ^(-^-l 7 *) ® e ii 
where V is the d x dd' matrix from Eq. ( |2.1| ), r is an arbitrary vector of size d x 1 and {e,j} 
is an orthonormal basis of vectors of size d' x 1. It is then straightforward to show that 
Eq. O reduces to Eq. Q (see, e.g., Ref. @, p. 20] for details). Choi's proof of this 
result follows a different direction and has been nicely revisited by Leung in Ref. □ 



Theorem 2.3 (Choi [p6[|). Let C(Ai d , Aid) denote the set of linear maps from Aid to Aid 
and consider the following map from C(Ai d , Aid) to Aid <8> Aid-' 

JC i-> & = (l d ® JC) ( E U ® E i,j ) > ( 2 - 3 ) 

where each Ejj is a d x d matrix of elements 1 in the position and elsewhere^. 
A linear map JC : Aid — ► Aid is CP if and only if &> 0. 

Proof. That & > if JC is CP follows directly from the observation that ^ - =1 Ejj <8> Ejj is 
a PSD matrix (see footnote 0). The converse, is typically proved by relying upon theorem 



2.2 ; see, for example, Ref. [p0| . □ 



In this thesis, the theorems above will not be used for testing the CP-ness of a given map. 
Instead, Eqs. (|2.ip, (|2.2| ) and ( |2.3| ) (with .8 > 0) are taken to be general representations of 
a CP map. 



2 Clearly, {Eij}f J=1 forms a basis for the so-called Weyl basis. In physics notation, Ejj = |i) 
where {|«)}f =1 is an orthonormal basis for the Hilbert space Hd = C d onto which the matrices of act 
as the linear (bounded) operators of £?(Hd). The (unnormalized) density matrix |^')(^ r | defined in this 



basis via \ty) = ^ i=1 \i) <8> \i) is thus identical to Y^ij=i^i,j an< l Eq. (2.3) is typically written as 

£=(X d g)/C)|#>(*|. 
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2.2.2 Quantum operations as CP maps 

In this section, we argue that a certain class of CP maps yields a suitable mathematical 
model for general state changes in quantum theory — an idea introduced long ago by Kraus 
25 1 . In order to make this precise, we start with a brief clarification of what is meant when 



a quantum operation is said to be a CP map. 

In the Schrodinger picture, a quantum operation can be thought of as a map on the set 
of density matrices of dimension d, «S(Hd). Since this is not a linear space (it is not closed 
under arbitrary linear combinations), a quantum operation is not a linear map, let alone a 
CP map [|3T|. Nevertheless, it is widely accepted (although also debatable, see e.g., Refs. 
[[32], |33| and references therein) that any quantum operation Q satisfies a weaker form of 
linearity called convex- linearity: 

Q[Xp + (1 - X)a] = XQ(p) + (1 - A) Q(a) for A G [0, 1] , (2.4) 

where p, a G iS(Hd). In this thesis we adopt the common practice of saying that Q is a CP 
map if the map obtained from its linear extension to i3(Hd) = Aid is CP. 

Having clarified this subtlety, we now follow justifying why quantum operations are 
suitably described by certain types of CP maps, namely, those that preserve the trace of 
their input. These are called completely positive and trace preserving (CPTP) maps. 



2.2.2.1 Equivalence between quantum operations and CPTP maps 

It is constructive to proceed in three steps: first we discuss why the set of quantum operations 
should lie within the set of positive maps, then we motivate the restriction to the subset of 
CP maps and finally we justify the equivalence with the set of CPTP maps. 



Quantum operations =>- Positive maps. Once Born's rule is accepted as a fundamental 
axiom of quantum mechanics, quantum states must be modeled by PSD matrices in order 
to guarantee that they will always yield well-defined probabilities. As a result, any map 
converting from arbitrary quantum states to arbitrary quantum states must necessarily be 
positive. 



Quantum operations =>- CP maps. Not every positive map can model the dynamics 
of an arbitrary input density matrix. This can be easily understood as follows: Start by 
assuming that every positive map V represents a quantum operation. From a physical view- 
point, this means that an experimentalist should (at least in principle) be able to implement 
such a map in the lab. Obviously, if V can be implemented, there is no reason why the map 
Xd®P could not be built as well, since it corresponds to evolving a bipartite d®d system by 
applying V to the second part and leaving the first part alone. However, as we learnt from 
Stinespring [p^]] , positivity of V does not imply positivity of I^^iV, or in other words, there 
exist input states to Xd <8> V that are not mapped to PSD operators^]. As discussed above, 



3 A standard example arises from d = 2, p any bipartite 2 ® 2 entangled density matrix and V the 
transposition map with respect to the basis of p p3, |35| . 
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this is inconsistent with Born's rule, leading to the conclusion that not every positive map 
can model the dynamics of an arbitrary input state, as claimed before. 

An easy solution is to restrict the set of quantum operations to the set of positive maps V 
for which T&®V is still a positive map. As demonstrated by Choi in Ref. P?|, this restriction 
characterizes the set of CP maps. 

It is worth mentioning an alternative (and more comprehensive) solution. No conflict 
with Born's rule arises if we allow every positive map to be a quantum operation, but under 
the understanding that this is only physically meaningful with respect to some particular 
domains of density matrices. A further step is to let every linear operation mapping density 
matrices to density matrices to model quantum operations. These ideas are advocated and 
developed in Refs. §7|, fj§, g|, [HJ, [rf], . 



Quantum operations CPTP maps. It follows from the statistical structure of quan- 
tum mechanics that density matrices are PSD matrices of unit trace. Because of this, any 
map transforming density matrices into density matrices must be trace preserving^ . This re- 
quirement, along with the CP condition explained above, validates the implication quantum 
operations =>- CPTP maps. 

In order to see that the converse also holds, let us equip the Kraus decomposition of 
Eq. ( j2.2|) with the trace preserving condition Tr/C(p) = Trp. It is a simple exercise to show 
that every CPTP map must satisfy 

Y,K\K t = t A . (2.5) 

i 

We now demonstrate that any transformation of the form of Eq. ( |2.2| ) satisfying Eq. ( |2.5| ) 
can be regarded as the evolution of a part of a larger system which evolves unitarily. Since 
unitary evolution is an obviously valid type of evolution, we conclude that every CPTP map 
describes a valid quantum operation. 

To prove our point, we follow Ref. [2"I|, p. 365]: Let U be an operator such that 



U {ip) |eo) = Yli K-i 1^) \ e i) w here p = is an arbitrary state of the system of interest, |eo) 

is an arbitrary state of an ancillary system E and {|ej)} is an orthonormal basis of E. Using 
Eq. fl2.5|) it is easy to show that U preserves inner-products, i.e., (ip\ (e | WU \<p) |e ) = (i>\<p), 
therefore U can be realized as a unitary operation on the joint system. Moreover, from the 
definition of U and the normalization of the basis {|ej)}, a straightforward computation gives 

Tr E [U(p®\e )(e \)U^} = "£, K iP K h (2-6) 

i 

where Tr# denotes the partial trace over the system E. Since the rhs of the equation above 
is precisely the Kraus decomposition of Eq. ( |2.2| ), we have proved the implication CPTP 
maps quantum operations. 



4 Although CP maps that decrease the trace are used to model stochastic measurements processes, i.e., 
those that are selective of certain specific outcomes. In these cases, the trace of the resulting "density matrix" 
gives the probability associated to the specific measurement outcome. This allows for a more comprehensive 
definition of the term "quantum operation" as any complctely-positive-trace-non-increasing map, while the 
term "quantum channel" is generally reserved for the deterministic subset of trace preserving CP maps. In 
this thesis, though, both terminologies are used interchangeably to refer to CPTP maps. Whenever needed, 
we explicitly write stochastic quantum operations to refer to CP maps that decrease the trace. 
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2.2.3 Some useful results 

In this section we gather a number of standard results related with quantum operations 
and their representations. The intention is to offer the reader a self-contained and concise 
review of tools and concepts employed in the developments to be presented in the following 
chapters. The experienced reader should feel free to skip the material of this section and 
refer to it for notational purposes while progressing through the rest of the thesis. 



2.2.3.1 Entanglement Breaking and Trace Preserving maps 

As argued in the last section, the restriction to set of CPTP maps is necessary and sufficient 
to provide a mathematical description of every physically admissible transformation of an 
arbitrary quantum state. Experimentally, though, it is conceivable that not every CPTP 
can be implemented due to practical (but not fundamental) limitations. From a theoretical 
viewpoint, it is interesting to consider the impact of additional constraints on the set of 
realizable quantum state transformations. In this section, we introduce a relevant subset of 
quantum operations: the set of entanglement breaking and trace preserving (EBTP) maps 

BUSH- 

Conceptually, EBTP maps convert quantum information into quantum information via 
a classically mediated process. Quoting Fuchs and Sasaki HTfl, "the reader should be left 



with the imagery of a quantum state initially living in a large river of Hilbert space, later 
to be squeezed through a very small outlet" , and then discharged in a large river of Hilbert 
space once again. In what follows we give a construction of an arbitrary EBTP map that 
more precisely formulates the above description. 

First, the initial quantum state p is measured with a POVM of elements {P a }^ =1 , and 
depending on the classical outcome a, a pure state \h^) from an orthonormal basis set 
{|^d)}a=i °f the Hilbert space H d is prepared. The output state of such a process is of the 
form 

d 

QC(p)=J2^(P«p)\h a d )(h a d \ . (2.7) 

a=l 



Holevo |43| coined the term quantum-classical channel to designate maps of this form, since 
they express a decision rule transforming quantum states into probability distributions on 
an output alphabet {a}. 

After this, the POVM {{h^) (h^\}^ =1 is applied and a density matrix Qp is prepared 
conditioned on the classical outcome j3. The resulting state is then 

d 

CG(e)=X>( e |/i!)</iJ|)Q^ (2.8) 

(3=1 



Maps of this form are termed classical- quantum channels [43]. With a straightforward com 



putation, the composition of these two maps can be shown to be of the form 

d 

CQoQC(p) = J2Tr( P Pp)Qp (2.9) 

(3=1 
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which is known as the Holevo form — a defining expression of every EBTP map fl45 l. 

Because of the intermediary measurement, any entanglement present between the sys- 
tem evolving under the EBTP dynamics and other degrees of freedom is destroyed. Since 
the CQ channel does not allow these systems to interact again, the outcome of an EBTP 
map is always separable with respect to this partition, motivating the name "entanglement 
breaking" coined in Ref. [45]. 



An important property of EBTP maps is that they form a convex set. This can be 
promptly verified by noting that any convex sum of Holevo forms is still in the Holevo form, 
since any convex sum of POVMs is still a POVM. Every CQ map with a pure density matrix 
Qp is an extreme point of this convex set, however, for d > 3 there are extreme points which 
are not CQ maps. These and other structural characterizations of the set of EBTP maps 
are provided in Ref. fl45] . In Ref. [||| the case d = 2 was studied in detail. 



In the next section we provide a representation of EBTP maps via certain types of matri- 



ces — analogous to the representation of CP maps via the PSD matrices & (cf. Theorem p. 3|) . 
Because this way of characterizing maps is at the heart of most of the developments in this 
thesis, we also summarize similar results for the sets of positive and CPTP maps. 

2.2.3.2 Isomorphisms between maps and matrices 

A key idea to be employed in this thesis is that the problem of searching over a set of maps 
can be recast as searching over a set of matrices. Mathematically, this arises when these 
two sets are related through a bijective correspondence, i.e., they are isomorphic. In less 
technical language, each map in the first set is associated with a different matrix in the 
second set (injection), and the matrices obtained from this relation fully span the second set 
(surjection). In this section, we sketch this isomorphism for the sets of positive, CP, CPTP 
and EBTP maps. An excellent review of the duality between maps and matrices is given in 



Ref. H. 

In 1972, Jamiolkowski showed that the set P d et of positive maps V : £>(H d ) — > £>(H d ) is 
isomorphic to the set ^3^ of matrices ^3 G B(H d ® H d ) such that 

(hS\(4\y\h$)\tf d )>0 for a,/? = l,...,d, (2.10) 
where {|/i d )}a=i is some orthonormal bases of H d . The isomorphism J : V^ ct — > ^3^ is given 

by 

7>^<P = (2k<8>7>)(|tf)<*|) , (2.11) 

where we have defined the unnormalized, bipartite (and maximally entangled) state |\&) := 
EcUi l^d) ® \ h d) e H d ® H d . The reader should recognize the similarity of the map above 
with the one introduced in Theorem ^3] on page [7| (see also the footnote ^| on the same page). 



A few years later, in his landmark 1975 paper [p6| , Choi obtained a analogous result 
between the set /C d et of completely positive maps /C : £>(H d ) — > £>(H d ) and the set 8.^ of 
matrices 8. G £>(H d ® H d ) such that 

(v\&\v)>0 V \v) G H d <g> H d , (2.12) 

that is, the cone of positive semidefinite matrices ^ of dimension d 2 . Once again, the map 
introduced in Eq. ( |2.3j ) - - defined between /Q et and - - gives the expression of this 
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isomorphism. A more recent and very neat demonstration of the isomorphism between /C d ct 
and is given in Ref. [|9|. 

Remarkably, condition (|2.12| ) is more demanding than condition ( |2.10|) , since the set 
of product states \x) <g> \y) is just a subset of the set of states that can be formed in the 
full Hilbert space H d Cg> H d . This is hardly surprising, since complete positivity is a more 
demanding constraint than mere positivity, as discussed in Sec. |2.2.1 . It is curious, though, 
that it is the set of completely positive maps (as opposed to the set of positive maps) that 
is isomorphic to the cone of positive semidefinite matrices. 

For the purposes of this thesis, our main interest is on an isomorphic correspondence for 
the set Q s d et of CPTP maps on Q : £>(H d ) — > i3(H d ). This is because the elements of this set 
model quantum operations, and for reasons that will become clearer later, it will be crucial 
to have a matrix characterization of how quantum mechanical systems can evolve. In Refs. 
pU| , |5*Tf , <2 d ct was shown to be isomorphic to the set Q. s $ of matrices G i3(H d <S> H d ) such 
that 

> and Tr 2 = l d . (2.13) 

Here, can be regarded as an (unnormalized) bipartite density matrix and Tr 2 denotes the 
partial trace over the second subsystem. Clearly, the positivity constraint > arises from 
the CP character of every CPTP map, the partial trace constraint, in turn, follows from the 
additional trace preserving requirement. 

Yet another set of maps of interest is £> d ct , formed by the EBTP maps defined in the 



last section. Even before the characterization of Eq. ( f2.9| ) given in Ref. jRJ , these maps had 
long been employed as CPTP maps whose matrices are always separable across the two 
parties of dimension d [[H], |51|, |52|, |53|, |54]]. That is, £> d et is isomorphic to the set of matrices 
*B d 2* whose elements satisfy 

23 > , Tr 2 23 = l d and 23 is separable across d <S> d. (2-14) 

The matrix elements forming the image of each one of the isomorphisms discussed above 
are obtained from 

d 



<t=(2k®C)(|tt)(tt|) 



Q, /3 = 1 



h a d )(h d \®C(\h« d )(h»\) 



(2.15) 



where C represents any one of the maps V, /C, Q or B while £ corresponds to ^3, or 23, 
respectively. Henceforth, for any linear map C, we shall refer to the matrix £ computed via 
Eq. ( |2.15| ) as the Choi matrix of C. 

On the other hand, Eq. ( 2.15|) can be "inverted" to characterize the action of any linear 
map C from its Choi matrix £ El 

C(q) = Th [(g T ® l d )<£] Vf? e B{H d ) (2.16) 

where the transpose operation T is taken with respect to g written in the basis {|^ d )}a=i- 
To see that Eq. (|2.16| ) actually inverts Eq. (|2.15| ) for an arbitrary linear map C, note that 
the substitution of the latter into the former immediately leads to a tautology: 



C{q) = Tn 



a,P=l 



H\g T \h2)C(\h2){4\)^e(g) 



(2.17) 
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where the linearity of C was used in the last equality. 

A good example of application of Eq. ( |2.16| ) is the straightforward derivation of the 
expression of the trace preserving condition of a CPTP map Q in terms of its Choi matrix 
H. Simple substitution of Eq. ( ^.16| ) into Tr <2(p) = Trp leads to the constraint Tr 2 £l = Id) 
already given in Eq. ( |2.13|) . 

Throughout this thesis, we will often recur to represent quantum operations via their 
Choi matrices. For this reason, some practical mathematical tools for dealing with these 
objects are most welcome. In the next section we introduce some of them while approaching 
a technical problem of interest. 



2.2.3.3 The Choi matrix of a composed CP map and the vec/mat operations 

In this section we solve the following problem: Given Choi matrices & a and of two CP 
maps JC a and JCb, how can we express the Choi matrix of the composed map JCb o JC a as a 
function of &b and £ a ? 

The solution for this problem will be useful later in this thesis. We also present it here 
because it sets the stage for the introduction of some tools for the algebraic manipulation 
of Choi matrices. These will be employed, for example, in the next section, where we show 
that Kraus decompositions and CP maps are not isomorphically related. 

We start by applying Eqs. (|2.15| ) and (|2.16|) to write 

& ba = (J d ® JC b o JC a ) |tf>(# | = (J d ® JC b ) o (J d ® JC a ) |tf>(# | = 

(J d ® JC b ) & a = Tr : ® l da ) I] , (2.18) 

where & denotes the d 4 -dimensional Choi matrix of the map Z d ®/Q, and Tr! the partial trace 
over the first d 2 -dimensional subsystem of its argument. In order to continue from here, we 
need to express ^ in terms of The following short review on matrix vectorization will 
provide adequate tools for the solution of this problem. 



The vec and mat operations. The vec-operator [53, 56] is defined as the transformation 



of any matrix into a vector by stacking the columns of the original matrix. For example, 



vec 



a c 
b d 



( a \ 

b 

c 



(2.19) 



There are a number of useful properties of vec that can be easily verified |56]: For A, B,C E 
M d , 



vec A = (l d ® A) I*) 
vec A^vecB = Tr (A*B) 

vec (ABC) = (C T <8> A) vec B 
vec (A <g> B) = P d 4 (vec A <g> vec B) 



(2.20) 
(2.21) 
(2.22) 
(2.23) 
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where, in the first line, A is defined in the same basis used to define in the last line, P d 4 
is a permutation matrix whose explicit form is worked out in Appendix [A. 1|. It is also not 



difficult to see that for every vector v of d 2 entries, there exists a unique matrix A G Aid 
such that vec A = v. This establishes an inverse operation for vec, which we shall denote by 
mat, i.e., M = mat v. 

Let us now see how this machinery can be of assistance in solving the composition problem 
at hand. First, note the following relationship between the Choi matrix of any CP map /C 
and the Kraus decomposition of Eq. (|2[ 



a = (j d ® k) mm = J2 ® mm (i d ® k 



vec Fj (vec Fj)* 



(2.24) 



where Eq. ( |2.20| ) was used to establish the last equality. 

Now, for sake of generality, define the CP map T := E® T , where E and T are CP maps 
with Kraus operator sets {E{\ and {Fj}, respectively. Using Eqs. ( |2.24p , ( |2.23|) and some 
straightforward algebra, we find that the Choi matrix X (of T) can be written in terms of 
the Choi matrices <£ and 5 (of E and F) as follows: 



X = ^ vec (Ei <g> Fj) [vec (F* ® F,-)] 1 " 



P d 4 [vec Ei <g> vec F,-] (vec F;) 1 " <8> (vec F,- 



F d 4 



vec Ei (vec Ei) ] <g> ^ vec F,- (vecF,-) 1 " 



F d 4 
F d 4 



F d 4(€®^)F d 4. 



(2.25) 
(2.26) 

(2.27) 
(2.28) 



This result and the trivial fact that the Choi matrix of the identity map is can be 

used to write & in terms of Kb, 



£ = P d 4(|^)($|®£ 6 )P d 4, 

which substituted into Eq. (|2.18 ) leads to 

& ba = Tn ® l da ) F d 4 (|^)(^| ® ^) F d4 ] 



(2.29) 



(2.30) 



Since both F d 4 and |^ r )(^ r | are fixed matrices for a given dimension d, the equation above 
gives &ba solely in terms of & a and £&, as required. 



2.2.3.4 Non-uniqueness and construction of Kraus decompositions 

The goal of this section is to show that, contrary to the one-to-one relationship between 
CP maps and Choi matrices, the relationship between CP maps and Kraus decompositions 
is one-to-many. Moreover, we describe a practical method to construct any set of Kraus 
operators from the unique Choi matrix of a given CP map. As a byproduct, we will find 
that every CPTP map acting on a d-dimensional system can be written in the Kraus form 
with no more than d 2 Kraus operators. 
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Let us start by checking that a given CP map can be decomposed in the Kraus form in 
(infinitely) many different ways. In order to do so, we show that if Kj is regarded as set of 
Kraus operators of a CP map /C, then so is the set Ai defined by 

Ai-^^UijKj, (2.31) 

j 

where the complex numbers Uij represent the matrix elements of an isometry U. This result 
has a long history, being firstly stated in a closely related form by Schrodinger in 1936 |5"? 



and ever since rediscovered numerous times |58| , p9| (see Ref. |Kj for a both technical and 
historical review). 

It suffices to show that the set Ai gives rise to the same Choi matrix A as the set Kj. 
Let 21 denote the Choi matrix of the map generated by Ai, from Eq. ( |2.24j ) we have 

21 = vec At (vec Ai) = I u ij u i,k I vec Kj ( vec Kk) — vec Kj ( vec Kj) — & ? 

i j,k \ i / j 

(2.32) 

which establishes the claimed result. Notice that in the last line we have only used the 
fact that Ylii u i,j u *ik = $k,j, which is a common property of any isometry. Since there are 
infinitely many different isometries to choose from, infinitely many sets of Kraus operators 
for the same quantum operation can be generated from Eq. ([2.31|) . Actually, by varying over 
all isometries, Eq. ( |2.31| ) gives rise to every possible set of Kraus operators of a fixed CP 
map K (see [^TJ p. 372] for a proof). 

A closely related question is how a set of Kraus operators can be constructed from a given 
Choi matrix. In what follows, we present a canonical procedure that, on its own, reinforces 
the idea that there are (infinitely) many different choices of Kraus operators for a given CP 
map and clearly demonstrates that every CP map can be specified with a set with no more 
than d 2 Kraus operators. 

The fact that any Choi matrix ^ is a d 2 x d 2 PSD matrix can be expressed via the 
equality |HJ 

R=S*S, (2.33) 

where S is a matrix of dimensions d' x d 2 for arbitrary d'. A set of Kraus operators for a CP 
map K, can be obtained from A by reshaping the columns of into matrices, as follows: 

Ai = mat [coL. for i = l,...,d', (2.34) 

where colj is the operator that extracts the i-th column of its matrix argument. Because 
there are many different choices of matrices S that decompose ^ in the form of Eq. Q2.33 ), 



many different Kraus decompositions can arise from Eq. ( j2.34| ) — this is just a different way 
of observing the already demonstrated one-to-many correspondence between Choi matrices 
and Kraus operators. The Cholesky factorization of ^ f62 |, for example, can always be 



employed to yield the unique upper triangular matrix S with non-negative diagonal entries 
that satisfy Eq. (|2.33| ). In this case, we have d' = d 2 , which guarantees that it is always 
possible to decompose a CP map with no more than d 2 Kraus operators. In fact, the minimal 
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number of Kraus operators with which a CP map can be decomposed is equal to the rank 
of the corresponding Choi matrix 



It is straightforward to verify that formula ( |2.34| ) gives a valid set of Kraus operators for 
/C. Next, we do that by recovering the Choi matrix 8. when the above defined Ai are used 
in Eq. (fZTM): 



vec Ai (vec A;)^ = vec mat [cob (S^)] {vecmat [cob O^)]} = 

i i 

J] cob {&) [cob (5 t )] t =&S = & (2.35) 



2.2.3.5 Basis for hermitian matrices 

By construction, every Choi matrix is a hermitian matrix. In many cases, it will be helpful 
to expand a Choi matrix in some fixed basis of hermitian matrices and characterize the 
associated map by the real coefficients of the expansion. In this section we set the conventions 
of the basis we will employ, as well as derive the expansion of an arbitrary CPTP map with 
respect to it. 

We shall denote by {H£ }^=i an Y set of hermitian matrices forming a basis for the space 
of hermitian matrices and satisfying the following properties: H\ = Id is the only element 
of the set with a non-zero trace, that is Tr = d5 a> i. Moreover, we would like to think of 
the remaining matrices as higher dimensional generalizations of the Pauli matrices for d=2, 





1 






< -i \ 


.1=1 




; ■ 


02 = 1 


< i J 



and a 3 = ( J ^ V (2.36) 

Since the Pauli matrices satisfy Tr ((TaCTp) = 25 at p, for a, [3 — 1, . . . , 3, we shall require this 
same orthonormalization property for H%, i.e., Tr (^H^H^J = 25 a ^ for a, /3 = 2, . . . , d 2 ; or, 
accounting for the convention H\ = 1^, 

Tr = 8 a , p [dSp,! + 2(1 - 6 Pil )] , (2.37) 



for a, (3 = 1, . . . , d 2 . For d = 3, the identity matrix and the Gell-mann matrices |63| provide 
a possible construction of the basis set {H^}^ =1 . For larger dimensions, we have used the 
generators of SU(d) (plus the identity matrix) as a basis. We note that while our generalized 
basis is still hermitian and orthogonal like the Pauli matrices, the matrices H£ are generally 
not unitary for d > 3. 

We now move to characterize the Choi matrix of a CPTP map Q acting on a d- 
dimensional quantum system. As discussed before, H will be a matrix of dimension d 2 . 
It will be convenient to expand it in a tensor product basis of the form 

d 2 

&=Y1 (2-38) 
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Since this represents the Choi matrix of a trace preserving map, we must have that 
Tr 2 = Id- Clearly, this matrix constraint can be rewritten as several scalar constraints, as 
follows: 

Tr[(Tr 2 a.)H$] =TrH% for a = 1, . . . , d 2 . (2.39) 



From the definition of the partial trace [21, p. 107] and the properties of H£ established 



above, we can even rewrite the above as follows 

Tr [0 (H% ® l d )] = d5 a ,i for a = l,...,d 2 . (2.40) 



By substituting the expansion of Eq. ( p.38|) in the equation above, and solving the resulting 



set of equations for the coefficients x aj p, we find that x a ,i = 5 a ,i/d for all a = 1, . . . ,d 2 . 
This condition, used in Eq. (|2.38|) , gives rise to the following general expansion of any Choi 
matrix of a CPTP map: 

a=l 
0=2 

Of course, many other constraints apply on the coefficients x a ^ in order to guarantee that 
> 0. In the next section, we shall explicitly look at them in the case of d = 2. 

2.2.3.6 CPTP maps on M 2 

Up to here, we have surveyed a few properties of CPTP maps that are valid independently 
of the finite dimension d of the matrices onto which they apply. In this section, we review 
some important results which are peculiar to the case d = 2. These will be important in our 
study of qubit state transformations in Ch. |5], ^ and |7|. Most of the results discussed here 
were obtained by Ruskai et al. in Refs. [0, |65|j . 



Diagonalizing CPTP maps on qubits When we restrict the action of the CPTP maps 
to 2 x 2 density matrices, there is a nice alternative representation of these maps due to 
King and Ruskai [Q which, among many other things, enhances our intuition on how such 



maps transform the state of a single qubit. The derivation of this representation closely 
resembles a diagonalization procedure and is, in essence, the application of the singular 
value decomposition to some matrix formed from the expansion coefficients of the Choi 
matrix, as discussed above. In what follows, this procedure is presented in detail. 

Let be the 4x4 Choi matrix of a CPTP map Q on a single qubit state. Then, from 
Eq.( |2.41 ) we have 



= il 4 + X^(^®o- z ) ■ ( 2 - 42 ) 

j=0 

i=i 

where are the (real) expansion coefficients and we adopted the convention a = 1 2 . 

We now give a general (and convenient) form for the input state of Q. Since any density 
matrix is necessarily hermitian, we can write 



2p = ct 2 + R ■ o- , 



(2.43) 
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where c6K and R G R 3 is the vector formed from the expansion coefficients of p (multiplied 
by two) for the basis elements <Ti, a 2 and 03, in such a way that Eq. ( |2.42| ) is an ordinary 
expansion on the Pauli basis. Besides being written in more compact notation, Eq. (|2.42|) 
highlights the special significance of the vector R as the Block vector of the state p. Because 
the Pauli matrices are traceless and density matrices have unit trace, we should set c = 
Trp = 1. However, for later use, it will be convenient to proceed leaving c unspecified. 

With the aid of Eqs. ( |2.16 ), ( |2.42|) and ( |2.43| ), the action of the map Q on p can be 
written in terms of the coefficients qji as 



2Q(p) = ct 2 





( 2g i ^ 






2g 2 


H 




V 2g 03 j 





2q n -2q 21 2q 31 
2q 12 -2q 22 2q 32 
2gi3 -2g 2 3 2g 33 



R 






R 


I)} 




R 




\ Or 3 



(2.44) 



We now apply the singular value decomposition to the 3x3 matrix above. Because q^ G R 
for all j and I, the unitary matrices arising from the SVD can be chosen to be real orthogonal 
matrices, 



2q u -2q 21 2q u 
2q l2 -2q 22 2q 32 
2gi3 -2g 2 3 2g 33 



P! 
p 2 
p 3 



Oj = R 2 



P! 

p 2 
p 3 



Rj, (2.45) 



where Oi,0 2 G 0(3, R) and P{i, 2 , 3 } > 0. The second equality expresses the fact that every 
element of 0(3, R) is either a rotation [i.e., an element of 50(3, R)] or the product of a 
rotation with the inversion — 1 3 . As such, the equation is valid for some Ri,R 2 G 50(3, R) 
if we absorb the sign of a possible inversion in the coefficients /i{i,2,3}- As a result, we have 
|^{i,2,3}| = AHi j2) 3}, but the matrix diag [pi, p 2 , p^] is not necessarily PSD. Applying this 
decomposition to Eq. ( |2.44j ), we find 



2Q{p) = cl 2 + R 2 




p x 
p 2 
p 3 





(2.46) 



which reveals the CPTP map Q as an affine transformation of the Bloch vector. This can 
be interpreted as follows: First, the Bloch vector is rotated by Rj, then, the x— , y— and z— 
components of the rotated vector are rescaled by p 1} p 2 and p 3 , respectively. Subsequently, 

/ 2g i \ ' 

the constant vector cs with s := Rj I 2go2 is added and finally another rotation of the 

' \ 2g 03 / 

resulting Bloch vector by R 2 is performed. This sequence of transformations is illustrated in 
Fig. |3 

Because every rotation of the Bloch vector corresponds to a unitary transformation of 
the density matrix, we can write 



Q(p) = UV(VpV ] )U ] 



(2.47) 



where V is the unitary associated to the rotation Rj, U is the unitary associated to the 
rotation R 2 and T> is said to be a diagonal CPTP map on the Pauli basis, implementing the 
rescaling and the translation explained above. 
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Figure 2.1: Affine transformation on the Bloch sphere due to a CPTP map 



The allowed coefficients Mi, 2,3 an d £1,2,3. From the results of Sec. 2.2.3.2, not every 
real value of coefficients qji substituted in Eq. ( |2.42j ) will lead to a valid CPTP map Q. In 
order to produce a sharp characterization, it is necessary to specify every possible choice of 
coefficients that turn H into as a PSD matrix. This (almost heroic) task was accomplished 
by Ruskai, Szarek and Werner ||65|| , who proved that any map Q in the form of Eq. ( |2.46|) is 
CPTP if and only if the following inequalities hold 

(/.i + < (i + „ 3 ) 2 -4- (4 + si) ( ] + fl3 t S3 ) > ( 2 - 48 ) 

V 1 - M3 ± S3 ) 

1 - fX 3 ± S3 
1 + fJL 3 ± S3 



- (4 + 4) ( \ , 7X3 ) > ( 2 - 49 ) 



[1 - (//? + £ + //i) - {4 + 4 + 4)} 2 > 

4 ( S 2 + fty + ^ ^2 + ^2^ + ^2 ^2 + ^ _ 2fJllfl2fJl3 ] . (2.50) 



In terms of the representation of Fig. |2.1| , these inequalities establish that not every 
ellipsoid internal to the Bloch sphere can be obtained from the action of a CPTP map; 
furthermore, they precisely characterize which ellipsoids can be obtained in such a way. 

Remarkably, the inequalities above are saturated for the extreme points of the set of 
CPTP maps |65|| . This turns out to be equivalent to the conditions si = S2 = and 



M3 = M1M2 and S3 = (1 - /4)(1 - 14) (2.51) 
which leads to the useful trigonometric parametrization of the extreme points: 

iii = cos u, fi2 = cosv, /J.3 = cos u cos v , S3 = sin u sin v (2.52) 



with u e [0, 27r) and v G [0, tt). 
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A representation for the Choi matrix of a general CPTP map on a single qubit 

Throughout, it will be convenient to express the Choi matrix of the CPTP map Q in 
terms of the unitaries U, V and the scalars /i{i j2j3 } and £{1,2,3} introduced in the last section. 
Here, this representation is derived in an elementary way. 

Our first step, is to rewrite Eq. (|2.47|) in terms of the Choi matrices £3 and 23 of the maps 
CPTP Q and 23, respectively. For {D{\ a set of Kraus operators of T>, we have 



(X 2 ® Q)|*)<*| 


(2.53) 


(la ® UDiV) |#>(tf| (1 2 ® UDiV)* 

i 


(2.54) 


5^ vec (UDiV) [vec (UDiV)? 

i 


(2.55) 


( yT ® ^) vec A (vec A) + (^ T ® ^ 


(2.56) 


(^ T ®t/)s) (v t ®uy , 


(2.57) 



where in the third, fourth and fifth equalities were obtained from the application of Eqs. fl2.20|) , 
fljjp and ( gjg) , respectively. 

Eq. ( |2.57|) makes it clear that if we can express S3 in terms of ^{1,2,3} arid £{1,2,3}, then 
the desired formula for £3 can be obtained by simple unitary conjugation. Our next step is 
then to obtain this formula for S3. 

Since |^ r )(^ r | can be written in terms of the Pauli matrices as (1 4 + G\ ® o\ — a 2 <8> cr 2 + 
03 <8> cr 3 )/2, we have 

213 = 2{I 2 ®V) 

= 1 2 ® X>(1 2 ) + o-i ® X>(o-i) - cr 2 ® X>(<r 2 ) + ^3 ® ^(a 3 ) 

= 1 4 + 1 2 <g> (S ■ &) + /ilCTi <g> £7l - /i 2 0"2 <8> CT 2 + /i 3 £7 3 ® <7 3 , (2.58) 

where, for the last equation, we used Eq. ( |2.46| ) with Ri = R 2 = 1 3 to find that the 
action of T> on the Pauli basis is given by Z>(1 2 ) = 1 2 + s ■ a and T>(a k ) = fJ>k&k, for 
k = 1,2,3. Combining Eqs. ( |2.58| ) and ( |2.57| ), a formula for in terms of U, V and the 
scalars /i{i,2,3} and £{1,2,3} can be promptly obtained. Next, this formula is presented in terms 
of two orthonormal sets of real vectors, {vk}k=i,2,3 and {v,k}k=i,2,3, that specify the unitaries 
according to 

v k -cr:=V^a k V and u k ■ cr := Ua k U^ . (2.59) 

or more explicitly 

v k = \ (Tr [VVfcVtTj , Tr [V^a k Va 2 ] , Tr [V^a k Va 3 ]) , (2.60) 
u k = \ (Tr [Uautfai] , Tr [Ua k U ] a 2 ] , Tr [Ua k U^a 3 ]) . (2.61) 

In terms of these versors, we have 



3 3 
213 = 1 4 + ^2 Sk ^ 2 ®(u k -er) + ^2 {v k ■ cr T ) ® (u k ■ <x) 

k=l k=l 



(2.62) 
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From this equation and the above discussion, it should be clear that the specification of the 
six versors Vk and and the six scalars \xy. and s&, fully specify any quantum operation on a 
single qubit. In fact, it suffices to specify only two versors of each set, since for an arbitrary 
unitary matrix W and versors related to W as in Eq. ( [2.59D , we have Wk xwi = Ekim w m, 
where Ekim is the Levi-Civita symbol, and (k, /, m) is some permutation of (1, 2, 3) []. 



2.3 Semidefinite Programming 

In this section we introduce and study a class of optimization problems denominated Semidef- 
inite Programs. Although this is a widely developed chapter of convex optimization theory, 
our intention here is not to provide a comprehensive revision of the field (see [[18], ^0] for 
that purpose), but to concisely present the key aspects to be explored in the remainder of 
this thesis. 



2.3.1 Definition and common forms 

A Semidefinite program (SDP) is any optimization problem that can be written in the form 

maximize — Tr (E Z) 

subject to Z > (2.63) 
Tr (EiZ) = k, for i = 1, . . . , n (2.64) 

where the matrix Z is the variable of the problem, hi are given real numbers and Eq, Ei 
(i — 1, . . . , n), are given hermitian matrices. Any SDP written in the form above is said to 
be in the standard form. 

The name semidefinite program is motivated by the matrix inequality constraint Z > 
0, which implies that only positive semidefinite matrices Z are acceptable as a solution 
of (|2.63|) . Apart from this restriction, the n equality constraints Tr (EiZ) = hi completes the 
characterization of the feasible set of problem ( |2.63| ). This is a convex seiQ, since it is the 
intersection between the cone of PSD matrices and a number of (hyper) planes determined 
by the equality constraints, each of them being a convex set on their ownQ. 

The objective function — Tr (EqZ) is a linear function of Z, and as such is simultaneously 
concave and convex^. This simple observation — combined with the convexity of the feasible 
set — enable us to recognize SDPs as a special type of convex optimization problem^. This 

5 To see that, simply use these two well-known properties of Pauli matrices: (i) o^oi = ieuim^m [for 
every k ^ I and m such that the sequence (k,l,m) is a permutation of (1,2,3)] and (ii) (a • er)(b ■ cr) = 
(a • b)l 2 + i(a x b) ■ cr. 

6 A convex set is defined as the set for which every convex combination of its elements is still an element 
of the same set. 

7 Recall that the intersection of convex sets is another convex set. 

8 A convex function is any function / that satisfies f(px\ + (1 — p)x 2 ) < pf(xi) + (1 — p)f(x-z) for every 
x\ and X2 in the domain of / and p £ [0,1]. A function / is concave if the reversed inequality is satisfied, or 
cquivalently, if — / is convex. 

9 A convex optimization problem consists of the minimization of a convex function over a convex set (or 
cither the maximization of a concave function over a convex set). 
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is an important realization since in a convex optimization problem every local optimum is 
automatically a global optimum, therefore, numerical methods for solving SDPs do not get 
stuck at suboptimal solutions. 

There is another common presentation of a SDP that can be derived from the standard 
form as follows. Without loss of generality, we can expand Z on a basis of hermitian matrices 
(this is always possible because Z is constrained to be PSD, hence it is necessarily hermitian). 
Explicitly, if Z is a matrix of dimension d, then we can write Z = ^2 a y a H^, where H% 
(a — 1, ... , d 2 ) are the elements of the hermitian basis (e.g., those described in Sec. |2.2.3.5|) 



and y a are the d 2 real coefficients of this expansion. Problem ( |2.63| ) then assumes the form 



d 2 

a=l 




d 2 
a=l 




d 2 
a=l 


, for % = 1, . . . , n. 



(2.65) 



The equality constraints in the last line form a linear system of n equations and d 2 variables, 
which can be solved to reduce the number of unknowns. Incorporating the solution of 
this linear system to the objective function and to the matrix inequality constraint, it is 
straightforward to see that the resulting problem is of the form 

minimize c T x 



subject to F + x jFj > for j — 1, . . . , m (2.66) 

j 

which is the so-called inequality form of a SDP. In this formulation, the variables of the 
problem are the entries of the vector x, while the real vector c and the hermitian matrices 
F Q and Fj are given. 



2.3.2 Lagrange Duality 

A useful tool for the solution of a SDP is the idea of Lagrange duality, which is basically 
a recipe to construct another optimization problem — the dual problem — whose solution 
bounds the solution of the original problem — the primal problem. The idea of Lagrange 
duality is not exclusive of SDPs, but it is particularly useful in this context because the dual 
of a SDP is another SDP, and the bound it provides on the solution of the primal SDP is 
usually tight. 



2.3.2.1 The dual problem 

Let us introduce the method of Lagrange multipliers to find the dual problem of prob- 
lem ( |2.6tj| ) — our primal problem. The key idea is to construct an upper bound to any 
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feasible value of the primal by augmenting its objective function with a weighted sum of the 
constraint, as follows: 



v) 



max | - Tr (E Z) + Tr (AZ) + v i t Tr ( E i Z ) ~ b i\ j > ( 2 - 67 ) 



where the matrix A and the scalars are the weights, commonly called Lagrange multipliers 
or dual variables. 

Clearly, if Z is a primal feasible point, then the last sum vanishes and the term Tr (AZ) 
is non-negative if A > 0, in which case g(A, v) is larger than any primal feasible value. 
The Lagrange dual problem (or simply the dual problem), consists of obtaining the tightest 
possible bound from Eq. (|2.67p , that is 



minimize g(A,v) 

subject to A > (2.68) 

If we use ~ to designate a feasible point and * to designate an optimal point, then the 
above construction implies that 

- Tr (E Q Z) < - Tr (E Z*) < g(A*, u*) < g(A : u) , (2.69) 

a relation know as weak duality. We shall defer until next section a discussion regarding the 
usefulness of these inequalities. 

We now show that the dual problem ( |2.68|) is, in fact, a SDP. First, note that the dual ob- 
jective function (|2 . 6 7|) can be rewritten as g (A, v) = — b T v + maxz Tr [(A — E + J2 i ViE^Z], 



from which it is clear that 



9{Kv) = { - bT " if A = ^-5>^ (2.70) 
oo otherwise. 



As a result, the dual problem (|2.68|) becomes 

minimize — b T u 

subject to Eq — ViEi > for i = 1, . . . ,n (2-71) 

i 

which is the inequality form of a SDP in the variable v. 



2.Z.2.2 Weak and strong duality 

We have just seen that, by construction, the dual problem imposes the ordering of Eq. ( |2.69|) 
between feasible and optimal values of the primal and dual problems. In this section, we shall 
discuss how this observation is useful, and give conditions under which a stronger relation 
can be obtained. 

For sake of notation, let us start restating the weak duality relation from Eq. ( |2.69| ) as 
follows: 

p<p*<d*<d, (2.72) 
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where p and d represent any feasible values of the primal and dual problems, whereas p* and 
0* give the optimal values of each problem. 

Weak duality is an invaluable tool for certifying that "conjectured optimal solution" 
is actually optimal. To see how this works, suppose that some physical problem can be 
formulated as a SDP. Thanks to physical intuition and/or pattern recognition from numerical 
analysis, one can tailor an educated guess as to what is the general analytic solution of the 
problem. As long as the guess is primal feasible, we can promptly compute a value of p, and 
the problem boils down to decide whether or not p = p*. 

For that purpose, we can start by writing the dual problem. Sometimes, the dual is 
simpler than the primal and a formula for 0* can be rigorously derived. In general, though, 
along the way of "solving the dual" we may need to make some hand-waving assumptions 
to proceed. The whole point is that if after all the assumptions we find a formula for d 
that matches our p, the problem can be considered optimally solved because the only way 
to reconcile d = p with the weak duality relation is to have p = p*. 

But, what if the solutions do not match? In this case, there are three (possibly co- 
existing) possibilities: 1) the guessed p is not equal to p*, 2) the assumptions made in the 
dual side were not good and yielded some V ^ d*, and/or 3) the optimal duality gap d* — p* 
is not zero. 

For many primal problems, it is possible to eliminate the third possibility from the above 
list, that is, many SDPs can have the strong duality property p* = d* guaranteed in advance 
by the so-called constraint qualifications. A very useful and simple-to-check type of constraint 
qualification is the Slater condition, which states that for any convex problem we have strong 
duality if there is some feasible point for which the inequality constraints are satisfied with 
strict inequalities. It is generally easy to find some trivial point (e.g., the identity matrix) 
that satisfies Slater's condition in the case of SDPs. 

Strong duality is also the underlying basis of most numerical methods for the solution of 
SDPs [[LJ| [20|, . These methods iteratively generate feasible points of the primal and dual 
problems that go towards vanishing the duality gap d — p. In practice, a small tolerance for 
the largest acceptable value of d — p is provided, and the algorithm runs until it is achieved. 
When that happens, the generated feasible points of the primal and dual problems are (up 
to numerical precision) their optimal solutions. In practice, the interior-point algorithm 
generally converges after a number of iterations between five and fifty |Tj|. Furthermore, a 
worst-case analysis reveals the theoretical complexity as a polynomial in the variables spec- 



ifying the problem size [66]. In essence, numerical solutions of SDPs can be very efficiently 



obtained on the basis of strong duality. 



2.3.2.3 Complementary Slackness 



For SDPs exhibiting the strong duality property, there is a very neat and useful relation 
between the optimal solution of the primal problem, Z*, and the matrix E — £\ v*Ei 
from the inequality constraint of the dual problem. This relation is called complementary 
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slackness, and is obtained as follows: 



Tr (E Z*) =-J2 W " Tr (E Z*) = - ^ Tr 



Tr 



£ 



E 



Z* = , (2.73) 



where the last implication follows from the fact that both Eq — £V z^-E 1 ; and Z* are PSD 
matrices according to the constraints of the dual and primal problems, respectively. 

Going on with the idea from the previous section of obtaining an optimality certificate for 
a candidate primal solution, the complementary slackness relation can be of assistance in the 
following sense: instead of attempting to solve the dual problem, we can solve the set of linear 
equations (Eq — v^E^Z = for the variables i/j, with Z our guess of the primal problem. 
If the resulting values z/j give rise to a matrix E — £\ z/ji^ that is positive semidefinite, then 
we have guaranteed that the candidate is optimal; otherwise, strong duality guarantees that 
this is not so. Of course, proving positive semidefiniteness of an analytical matrix can still 
be a challenging task. 



2.4 Optimal Quantum Control from Semidefinite Pro- 
grams? 

In this section we begin to formalize the quantum control task of converting a sequence 
of source density matrices into a sequence of target density matrices via a single quantum 
channel, as illustrated in Fig. [Ll| 

We proceed by first motivating the convertibility problem in the context of quantum 
information. Then we give some first steps towards a mathematical formulation of the 
problem. Although a complete formulation will be only achieved in Ch. the goal here is 
to introduce the basic technical ideas and to illustrate the adequacy of the SDP formalism 
for the optimization of quantum operations. 

2.4.1 Why transforming between sequences of density matrices? 

In quantum information science, the problem of transforming between sequences of den- 
sity matrices is relevant because it immediately connects to the problem of transforming a 
quantum system whose initial preparation is uncertain. Many "no-go theorems" for trans- 
formations of uncertain inputs are known, as well as quantum machines that attempt to 
implement these transformations "as well as possible" . 

In general, the construction of optimal quantum machines is conducted in a ad hoc basis, 
envisaging a specific type of impossible transformation. Adopting this approach, optimal 
schemes for discriminating between non-orthogonal quantum states, cloning, purifying, error 
correcting, etc. have been designed. The realization that all of these problems can be 
phrased in terms of transformations between sequences of density matrices leads to a more 
general problem that encloses the fundamental barriers imposed by quantum mechanics, and 
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whose solution provides a generalized optimal machine that can be applied in many different 
circumstances. 



2.4.2 A special case: single-state transformation 

As a first exposure to the problem of converting between sequences of density matrices, 
consider the simplest case where both source and target sequences have a single element 
(7 = 1). This will lead to a trivial but instructive solution. Our aim is then to determine 
a quantum channel that transforms a given d-dimensional density matrix p into another 
d-dimensional density matrix p. In what follows, we prove (by construction) that a channel 
implementing this conversion always exists, regardless of the specific details of the states p 
and p. 

Let {\j)} be any set of pure states forming a resolution of the identity, i.e., ^ . = Id- 
For any d-dimensional density matrix p, we can write 

P »,./){./ , (2-74) 

3 

for some set of real coefficients a,- such that Ylj a j = 1- Now, let the matrices A^ k = 
y/a] \j)(k\ represent the Kraus elements of a map Q, as follows: 

Q(Q)=Y J A hk QA\ k . (2.75) 

That Q is a CP map is immediate from its construction via a Kraus decomposition. Fur- 
thermore, Q is also trace preserving: by exploiting the unit-sum property of the coefficient 
Oj and the fact that resolves the identity, it is easy to check that k Aj k Aj^ k = l d . 

A straightforward calculation then shows that Q perfectly converts p into p: 



P ■ 



Q(P) = E \J) ( k \ PV^J l*> 01 = E °i k'> 01 ( k \ P \k) = pTrlpJ2 |*> <*| 

(2.76) 

where, in the last equation, we have used the completeness relation for the sum and the 
normalization of the density matrix p. 

Remarkably, the set of Kraus operators A^ k of Q is independent of the initial state p, 
but exclusively defined in terms of p. From a physical viewpoint, this means that Q does 
not literally transform p into p, but instead constructs ~p "from the scratch" , by completely 
ignoring the original state p. The possibility of neglecting available information and yet im- 
plementing the conversion with arbitrarily high precision is not a feature inherited by more 
general convertibility problems. In fact, as shown in Appendix |D.1| , a theorem by Alberti and 



Uhlmann sets necessary conditions for the existence of a quantum channel accurately trans- 
forming between pairs of density matrices. Moreover, even when such conditions are met, 
the resulting channels usually depend on the details of both source and target states. The 
derivation of these channels is one of the main objectives of this thesis. In the next section, 
we give the first steps towards their determination via the solution of certain optimization 
problems. 
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2.4.3 The general case: multi-state transformation 

In this section, we start to assemble "mathematical devices" that take as inputs the sequences 
of source and target density matrices and outputs a channel of a certain type that implements 
the conversion between them. In the case where the desired transformation turns out to be 
physically impossible, such a device is constructed to output (i) a channel that optimally 
approximates the unphysical transformation and (ii) a number that quantifies the quality of 
this approximation. 

A general realization of the "device" we are talking about is an optimization problem of 
the form 

nun (®[C{ Pi ),n) , (2-77) 
cec d 

where Q et specifies a family of maps formed by all the admissible controllers C, and (@ [C(pi),Pj\) 
is some notion of distance between the sequences [C(pi)] i=1 and [pjj =1 . Well suited choices of 
(®) should yield a solution C of problem ( |2.77| ) such that (@) vanishes whenever the target 
sequence can be approximated with arbitrary accuracy via an element of Q ct . When that is 
not the case, the resulting operation C yields a non-zero value of (@!) which is interpreted as 
an estimate of the quality of the best available approximation. Possible measures (@) will 
be constructed and discussed in the next chapter. 



Strictly speaking, problem (|2.77|) represents a large class of optimization problems labeled 
by Q et and In the remainder of this chapter, we let be arbitrary and focus on the 
subclasses of problems where the feasible set Q et is taken to be either the set of CPTP 
maps, Q d et , or the set of EBTP maps, B^. In particular, aiming for efficient solution of 
these problems, we attempt to make the constraints C G Q d et and C G £> d et as similar as 
possible to those constraints appearing in the standard and inequality forms of semidefinite 
programs. 



2.4.3.1 The constraint C G Q d et (CPTP maps) 
Thanks to the isomorphism expressed by Eq. ( |2.13| ), the constraint of problem ( |2.77| ) with 

"»set (O se1 



Cj et = Q d et can be written in terms of the Choi matrix £ as follows 



minimize ( @ [C (p, ) , p { ] ) 

subject to € > (2.78) 
Tr 2 £ = l d . 



Furthermore, employing the same reasoning used to derive Eq. (|2.40 ), the equality constraint 
Tr 2 £ = Id can be rewritten to give 

minimize (@ [C (/>»), pj) 

subject to £ > (2.79) 
Tr[£(fl-f ®l d )] =dS aA for a = l,...,d 2 , 

where {H^ }^ =1 is an orthonormal basis of hermitian matrices of dimension d, as defined in 
Sec. |2.2.3.5| . 

Clearly, the constraints of the optimization problem above closely resemble those of a 
SDP in the standard form [cf. Eq. fl2.63| )l: while the Choi matrix €. plays the role of the 
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matrix variable Z, the equality constraints are identical to those of Eq. ( [2. 63] ) under the 
identifications E a = H$ ® l d and b a = d5 a> i. 

Following Eq. ( |2.41| ), we can also explicitly solve the equality constraints to find 

minimize ( 3> [C (p i ) , p J ) 
1 d2 

subject to -j- + x ^pH2 <g> iff > . ( 2 - 80 ) 

a=l 

13=2 

Now, the optimization constraint resembles those of a SDP in the inequality form [cf. 
Eq. ( |2.66| )1: the variables are the real coefficients x a ^ that expand (£ in the product basis 
{if ^ ® -^d }« e=n wnue the matrix inequality of Eq. (|2.66|) is recognized via the identification 
F = l d2 /d and F afi = H% ® H%. 

2.4.3.2 The constraint C G B s f (EBTP maps) 



If we make C| ct = £| et in problem ( p. 77 ), then the resulting optimization problem can be 



written as problems fl2.79p or fl2.80p with the added constraint of <£ being separable across 
the partition d <S> d, as explained in Sec. |2.2.3.2j . Given the types of constraints occurring 
in a SDP, we are led to ask whether the separability of a bipartite matrix can be expressed 
with a finite number of linear matrix inequalities. Fortunately, the search for separability 
criteria is a currently active research field; unfortunately, a complete answer to our question 
is still an open problem. 

In a seminal paper by the Horodecki family [Q, it was proved that a bipartite PSD 
matrix A of dimension did2 is separable across the partition di ® d2 if and only if 

(l dl ®V)A>0 (2.81) 

for every positive map V acting on M.a 2 - Since the condition above is trivially satisfied by 
every CP map, it has only to be required for positive maps V which are not completely 
positive (PnCP maps). A standard example of such a map is transposition. 

In Ref. @], Peres showed that for 2 ® 2 systems, the condition (X dl ® T)A > — or in 
more standard notation, A Ti > — is not only necessary, but also sufficient to ensure the 



separability of A across di ® d2- The same conclusion was independently drawn in Ref. |35 
where the sufficiency clause was extended to 2 £g) 3 and 3 <S> 2 systems and, sadly, shown not 
to hold for larger dimensions. 



For our purposes, this result provides the following equivalent expression of problem ( [2.77 



when the elements of source and target sequences are qubit density matrices (d = 2) and 

qct = gset. 

minimize (@ [C(pi), pj) 

subject to € > , <£ T2 > (2.82) 
Tr 2 £ = 1 2 , 

which is just a restatement of the optimization problem Q2.78|) with the addition of the 
positive partial transpose (PPT) condition. Once again employing the expansion of £. from 
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Eq. ( |2.38 ) and solving the equality constraint, the problem above becomes 



minimize (@ [C(pi), pj) 

Subject tO -y- + ^^^a,/3 

a=l 

/3=2 



12 2 



H 



> 0. 



(2.83) 



where we have made use of the fact that the direct sum of two matrices is a PSD matrix if 
and only if each matrix is PSD. Clearly, the resulting constraint is characteristic of a SDP 



in the inequality form with F = ls/2 and F ( 



a,f3 



m ® H: 



Hi 



H 



/3 1 



What about larger dimensional systems? Can we still write the optimization constraint 



in the form of a linear matrix inequality? Formally, the Horodecki condition fl2.81| ) allows 
the following expression for arbitrary dimension d: 

minimize (@ [C(pi), pj) 



subject to — 



l d2 © l d ®V(t d ) 

Ve{PnCP} 



^2 X a,(3 



a=l 

(3=2 



{HI ® flf ) © H% ® V {H{ 



Vd{PnCP} 



(2.84) 



> 



However, unless the direct sum can be restricted to just a few instances of PnCP maps (as 
the transposition, in the d=2 case), the problem above is rather useless due to its infinite 
sized matrix constraint. Alas, the existence of a finite set (possibly dependant on d) of PnCP 
maps that provides a sufficient separability constraint is still an open problem []48| , p. 29]. 

Nevertheless, for practical applications in larger dimensional systems, it is usually a good 
first step to consider problem (|2.84|) with the direct sum restricted to the transposition map 



minimize (St [C(pi), pj) 

cf 

II OJ9 

subject to 



A 2d 2 

d 



£ 

a=l 

(3=2 



X a ,(3 



m ® h 



m © h 



/3 



> 0. 



(2.85) 



Of course, the problem above is just a relaxation of problem (|2.77|) with Q et = B d ct and, as 
such, will generally yield a Choi matrix that is a (unnormalized) PPT-entangled state rather 
than the desired separable (£. Throughout, we will denote by B d et the set of CPTP maps 
whose Choi matrices satisfy the PPT condition, which will allow us to refer to the problem 
above as an instance of problem ( [2.77|) with C d 
Replacing the constraint C G 



'set 



B s d et with C 



Jsct 



G Bf l is not only useful because it gives 
a treatable matrix inequality, but also because of the simple observation that we do not 
always need to tightly bound B^ 1 in order to obtain a solution which is an element of it. 
This idea is particularly useful if, a posteriori, we can check whether the outcome of the 
relaxed optimization belongs to the set of interest. 

Deciding whether a given state is separable or PPT-entangled state can be remarkably 
easy or difficult, depending on the particular state under consideration. For example, if the 
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outcome Ct of the optimization ( 2.85 ) turns out to be of rank 2 or 3, then we can be sure 
of its separability, since there are no PPT-entangled states with such ranks |J7|, More 
generally, making this decision has been shown to be a NP-hard problem ||69|| . Yet, a number 
of algorithms performing efficiently in many non-trivial cases do exist |70, ITT], 172 



3 

Distance Measures 



In this chapter we present a detailed analysis of several distance measures for the space of 
density matrices. Apart from being a topic of independent interest, this analysis is relevant 
in the context of this thesis because it provides a repertoire of distance measures that - 
used in the quantum control problem we started to formulate in the previous chapter — yield 
optimization problems that are both physically meaningful and mathematically treatable. 

We divide our analysis between closeness measures (e.g., the fidelity), and distance mea- 
sures (e.g., the trace distance). For closeness measures, we focus on reviewing several useful 
properties of the Uhlmann-Jozsa fidelity, and propose an alternative definition of this quan- 
tity that features many attractive properties. 

A similar discussion is presented for metrics on density matrices. In particular, we 
consider the trace distance, and the metrics induced by the Hilbert-Schmidt norm and the 
spectral (or operator) norm, which have their merits assessed in the same basis as the fidelity- 
like measures. 

All of these quantities are put at work in the next chapter, where they appear to quantify 
the distance between sequences of density matrices. By minimizing distances or maximizing 
closeness, we will obtain optimal strategies for quantum control. 

3.1 Introduction 

From a mathematically rigorous viewpoint, a distance measure ^ on a set S is a function 
^:§x§^l such that for every a, b, c G 8 the following properties hold: 

(Ml) 0(a, b) > (Nonnegativity) , 

(M2) @(a, b) = iff a = b (Identity of Indiscernibles) , 

(M3) @(a, b) = @(b, a) (Symmetry) , 
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(M4) @(a, c) < @(a, b) + 0(6, c) (Triangle Inequality) . 

Any such function is called a metric 

In physics, though, it is common to talk about distance measures that are not metrics. 
For example, the quantum relative entropy [[F^, |74], [75[ defined on the set S = <S(Hd) of 
d-dimensional of density matrices as 

B(p,a) := Tr[p log p] - Tr [p logo"] , (3.1) 

is widely accepted as a distance measure between p and a, despite not satisfying (M3) nor 
(M4)[j. The Uhlmann-Jozsa fidelity 3 r (p, er) J7B|, [Tj]] - one of the most popular notions of 
distance between quantum states — is not technically a metric either: First, it behaves as an 
"inverted measure of distance" being maximal (1) when p = a and minimal (0) when p and 
a are orthogonal, in clear disagreement with (M2). Second, even the "inversion" £F' := 1 — J 
which complies with (M2) violates (M4), and is thus not a metric either. 

For several quantum information applications, fulfillment of (M1)-(M4) is not as relevant 
as a compelling operational interpretation for the "distance measure" at hand. In this 
respect, the quantum hypothesis testing problem serves as a justification for many standard 
notions of distance between quantum states. 

Consider that n copies of a quantum system are identically prepared in one of the states p 
or a. One is then asked to perform a measurement on the system in order to reject a certain 
hypothesis about its identity (the null hypothesis) and accept another one (the alternate 
hypothesis) 

Hq (Null hypothesis): The collective state of the system is p® n \ 

Hi (Alternate hypothesis): The collective state of the system is o~® n . 

If the measurement procedure suggests rejection of the null hypothesis despite it being 
correct, we talk about an error of the first kind, which occurs with probability a. On 
the other hand, the acceptance of the null hypothesis despite its incorrectness is called an 
error of the second kind, which occurs with probability (3. 

Depending on the number n of states available, on the adopted measurement strategy 
and on whether or not a and (3 are treated symmetrically, different quantities may arise to 
specify the error probabilities. Intuitively, these quantities form a meaningful "measure of 
closeness" between quantum states, since the "closer" p and a are, the more likely it is to 
confuse them. 

The relative entropy, for example, arises from the minimal error probability (3 when a is 
constrained to be smaller than a constant threshold e and n — > oo. In this case, the quantum 



analogue of Stein's lemma [[78|, [79| establishes that 

f3~exp[-nS(p,a)\, (3.2) 
hence characterizing S as a measure of distance. 



^dn fact, S(p, a) is not even well defined for every clement of § x § consider, for example, p any mixed 
state and a any pure state to see the second term of Eq. (fO) to go to —oo. 
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Likewise, if the error probabilities are treated symmetrically, i.e., a and (3 are simulta- 
neously minimized, then the optimal measurement strategy gives rise to the recently de- 
termined |80], [8l| quantum Chernoff bound £ := — logmin < s <i Tr(p s cr 1_s ), via the error 



probability asymptotic behavior 

a + (3 ~ exp [— n£(p, a)} . (3.3) 

In the case where a single copy of the states is available to generate the measurement 
statistics (n = 1), the minimal error probability (symmetrically treated) was shown long 
ago [^2]. m to be related to the trace distance D between p and a (defined in Sec. |3.2.1.6|) 



via 

a + f3 = \ -T)(p,a). (3.4) 

The fidelity between a mixed state p and a pure state a = 5F(p, a) := p\ip), 

also arises from the quantum hypothesis testing problem with n = 1 and measurement effects 
given by M = \ip)(ip\ and M\ = — A little thought shows that if we adopt the 

convention that a click of M suggests acceptance of p and a click of Mi suggests acceptance 
of a, then the probability of an error of the first type vanishes, i.e., a = 0. An error of the 
second type can occur, though, if M clicks when the actual state was a. This happens with 
probability 

/3 = W>,p), (3.5) 

Besides a satisfying physical interpretation, other physical and mathematical properties 
of distance measures are desirable for certain specific applications. In this chapter we eval- 
uate a number of distance measures against a list of such properties, namely, nonnegativity, 
identity of indiscernibles, symmetry, unitary invariance, convexity/concavity properties, mul- 
tiplicativity under tensor products, monotonicity under CPTP maps, relations with metrics 
and computability. 

Before outlining the structure of this chapter, let us mention two notational points plus 
a warning that will be pertinent for what follows. 

• The notation y/A denotes the unique PSD matrix such that y/A ^ y/A = A. Clearly, 
y/A can only exist if A is PSD and is obtained from the following procedure: Let U be 
the unitary matrix that diagonalizes A, i.e., U^AU = D, with D a diagonal matrix of 
non-negative elements. Then y/A = U y/~DU^ where y/D is the entry-wise square root 
of D. Note that there many matrices B (other than y/A) such that B^B = A. For 
example, the Cholesky decomposition guarantees that for every PSD matrix A, there 
exists an upper triangular matrix B such that B^B = A. However, B is obviously not 
hermitian, let alone PSD. 

• For an arbitrary matrix A, \A\ denotes the matrix yj A^A. If A is hermitian, then 
we have the identity y/A* = \A\, which mimics the standard equation y/x* = \x\ for 
real numbers x. Likewise, from the definition of the square root given above, we have 
\A\ > for every matrix A, just like |x| > for all real numbers x. However, recall 
that the statement of a matrix followed by the symbols > means that the matrix has 
only non- negative eigenvalues, i.e., it is positive semidefinite. 



34 



Distance Measures 



Be aware that the product of two hermitian matrices A and B is not hermitian unless 
[A, B] = 0. Similarly, the product of two positive semidefinite matrices A and B is not 
positive semidefinite unless [A, B] = 0. This implies, for example, that the equality 
V AB = \f~A\fB~ only makes sense for commuting PSD matrices A and B. 



Section |3.2| — which is an adapted reproduction of Ref. || - starts by describing how 
the Uhlmann-Jozsa fidelity behaves with respect to the several properties mentioned before, 
and subsequently introduces a new alternative definition of fidelity between mixed states that 
is thoroughly analyzed along the same lines. In section [O] we review some metrics arising 
from well known norms on the set of density matrices, namely the Hilbert-Schmidt norm, 
the spectral norm and the trace norm. Sec. |3.4| concludes the chapter with the introduction 
of two averaging schemes that generalize the notion of a distance measure between density 
matrices to a distance measure between sequences of density matrices. 



3.2 An alternative fidelity measure for quantum states 



The understanding of the set of density matrices as a Riemannian manifold |22| implies that 
a notion of distance can be assigned to any pair of quantum states. In quantum information 
science, for instance, distance measures between quantum states have proved to be useful re- 
sources in approaching a number of fundamental problems such as quantifying entanglement 



34, 85], the design of optimized strategies for quantum control H, M and quantum error 



correction [S7|, |S5|, ^ [50, EH], [S]. In addition, the concept of distinguishability between 
quantum states |93j can be made mathematically rigorous and physically insightful thanks 
to the close relationship between certain metrics for the space of density matrices and the 
error probability arising from various versions of the quantum hypothesis testing problem 
^3| . Distance measures are also regularly used in the laboratory to verify the quality of the 



produced quantum states. 

A widely used distance measure in the current literature (or more precisely, a "closeness" 
measure between two general density matrices), is the so-called Uhlmann-Jozsa fidelity, 3\ 
Historically, this measure had its origins in the 70 's through a set of works by Uhlmann and 
Alberti [75|, ^ PH], who studied the problem of generalizing the quantum mechanical 
transition probability to the broader context of *-algebras. The usage of the term fidelity 
to designate Uhlmann's transition probability formula is much more recent and initiated 
in the works of Schumacher |)7| and Jozsa |[77|| . Indeed, in an attempt to quantify the 



"closeness" between a certain mixed state p and a pure state \ip), Schumacher dubbed 
the transition probability (ijj\p\ijj) the fidelity between the two states. In parallel, Jozsa 
recognized Uhlmann's transition probability formula as a sensible extension of Schumacher's 
fidelity, where now the measure of "closeness" is related to a pair of mixed states p and 
a. Ever since, Uhlmann's transition probability formula has been widely accepted as the 
generalization of Schumacher's fidelity. 

The prevalence of this measure as one of the most used notions of distance in quantum 
information is not accidental, but largely supported on a number of required and desired 
properties for the role. For example, 3 satisfies all of Jozsa 's axioms, that is, besides recov- 
ering Schumacher's fidelity in the case where one of the states is pure, the following three 
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additional properties also hold: First, 3 equals unity if and only if it is applied to two iden- 
tical states; in other cases it lies between zero and one. Second, it is symmetric, i.e., the 
fidelity between p and a is the same as that between a and p. Third, it is invariant under 
any unitary transformation on the state space. Nevertheless, 3 is not the unique measure 
satisfying these properties. A prominent alternative which also complies with Jozsa's axioms 
and shares many other properties of 3, is given by the nonlogarithmic variety of the quantum 
Chernoff bound, Q, recently determined in Ref. f8Tfl . 

Despite fulfilling the properties listed above, both 3 and Q are, in general, unsatisfying 
measures from a practical computational viewpoint. Although 3 can be expressed in a 
closed form in terms of p and a, it involves successive computation of the square roots 
of Hermitian matrices, which often compromises its usage in analytical computations and 
numerical experiments, especially when the fidelity must be computed many times. Even 
more serious is the case of Q, which to date has only been defined variationally as the result of 
an optimization problem. The question that naturally arises is whether an easy-to-compute 
generalization of Schumacher's fidelity can be obtained. In the following, we provide a 
positive answer to this question and a thorough analysis of our proposed alternative fidelity, 

3 N- 



As we were finalizing Ref. ffl, we became aware of a very recent work of Miszczak et al. [3S 



in which 3^ was introduced as an upper bound to the Uhlmann-Jozsa fidelity. In many ways 
our analysis of 3^ is complimentary to that provided in Ref. J98[; results in common are 
noted in the corresponding sections of this thesis. 

The following sections are structured as follows. In order to provide a concrete ground 
for our proposal of 3tv as an alternative fidelity measure, we firstly revisit, in Sec. |3.2.1| , a set 
of basic properties of the Uhlmann-Jozsa fidelity. In Sec. |3.2.2j we formally introduce 3at and 
analyze it in the spirit of the properties reviewed in Sec. |3.2.1| . The computational efficiency 
of 3tv is contrasted with a number of previously known distance measures in Sec. |3.2.3j . 
We summarize our main results and discuss some possible avenues for future research in 
Sec. ET21L 



3.2.1 The Uhlmann-Jozsa Fidelity 

In this section, we will briefly survey some physically appealing features inherent to the 
Uhlmann-Jozsa fidelity 3. In Sec. |3.2.2j , these features will be used as a reference for char- 
acterizing the proposed new fidelity measure. 



3.2.1.1 Preliminaries 

The fidelity 3 was originally introduced as a transition probability between two generic 
quantum states p and a [ffCjl , 

3(p,a) := max . | Q,\tp) | 2 = fir J^pa^p) . (3.6) 

Here, and \tp) are restricted to be purifications of p and a, while the second equality 
indicates that the maximization procedure can be explicitly evaluated. At this stage, it is 
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worth noting that it is not uncommon to find being referred, instead, as the fidelity 
(e.g.,, Ref. 0). 



In Ref. [77 1, Jozsa conjectured that Eq. (|5.34|) was the unique expression that satisfies a 
number of natural properties expected for any generalized notion of fidelity^. Throughout, 
we shall refer to these as Jozsa 's axioms: 

1. normalization, i.e., 3"(p, a) G [0, 1] with the upper bound attained iff p = a (the identity 
of indiscernible property); 

2. symmetry under swapping of the two states, i.e., ^(p, a) = ^(cr, p); 

3. invariance under any unitary transformation U of the state space, i.e., $(JJ pU', UaW) = 
5F(p, a); and finally, 

4. consistency with Schumacher's fidelity when one of the states is pure, i.e., 

3 r (p,\il>)(rl>\) = (il>\p\rl>) (3.7) 

for arbitrary p and 

The proof that J satisfies all of Jozsa's axioms follows easily from the variational defi- 
nition of Eq. ( |5.34| ) (see, e.g., Ref. |21] for technical details). The remainder of this section 



discusses a number of less immediate properties of 3 r . 
3.2.1.2 Concavity Properties 

The concavity property of quantities like entropy, mutual information and fidelity is often of 



theoretical interest in the quantum information community [21|. In this regard, it is worth 
noting that a useful feature of J is its separate concavity in each of its arguments, i.e., for 
Pi,P2 > 0, pi +P2 = 1 and arbitrary density matrices p±, p2, 0\ and cr 2 , we have 

3" (PiPi + P2P2, o-i ) > Pi^Pi, 0\) + P29 r (P2, 0\) ■ (3.8) 



By symmetry, concavity in the second argument follows from Eq. ( |3.(j| ). Separate concavity 
can be proved |76| , 77| using the variational definition of 5F from Eq. ( |5.34|) . 



While it is known that is jointly concave [p4l 



V^(p!p! +P2p2,PlO-l +P2V2) > PlV3(pl,CTi) +p 2 V / ^(p2,Cr 2 ) , (3.9) 

it is also known that the fidelity 5" does not, in general, share the same enhanced concavity 
property^. 



2 Although, as mentioned before, this conjecture can be seen to be false with the counter-example of the 
nonlogarithmic variety of the quantum Chernoff bound Q, determined in Ref. [pTj . 



3 



Note that joint concavity implies separate concavity but not the other way around. For example, the 



separate concavity of \/J-(p,a) can be obtained from Eq. (3.9) by setting cr\ — U2 and using the fact that 
Pi +P2 = 1- 
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3.2.1.3 Multiplicativity under Tensor Product 

Another neat mathematical property of jF(p, a) is that it is multiplicative under tensor 
products: for any density matrices pi, p2, o\ and 02, 

5(pi ® P2, <?i ® o- 2 ) = tri)^, o" 2 ) • (3.10) 

This identity follows easily from the following facts: for any Hermitian matrices A\ and IB, 
(i) Tr(A\ <g> IB) = Tr(AV) Tr(IB) and (ii) t/K®B = y/K®VB. 

An immediate consequence of this result is that for two physical systems, described by p 
and a, a measure of their "closeness" given by £F remains unchanged even after appending 
each of them with an uncorrelated ancillary state r, i.e., jF(p <S> r, a <S> r) = 5F(p, cr). 



3.2.1.4 Monotonicity under Quantum Operations 

Given that J(p, cr) serves as a kind of measure for the proximity between two quantum 
states p and a, one might expect that any quantum operation £ should bring p and cr 
"closer together" according to 5F: 

?(£(p),£(cr))>?(p,a). (3.11) 

Indeed, it is now well-known that Eq. (|3.11| ) holds true |Kj for an arbitrary quantum op- 
eration described by a completely-positive-trace-preserving (CPTP) map £ : p 1— > £(p). 
Inequality ( |3.11| ) qualifies 7 as a monotonically increasing measure under CPTP maps and 
can be considered the quantum analogue of the classical information-processing inequality 
- which expresses that the amount of information should not increase via any information 
processing. 

On a related note, it is worth noting that any measure M which is (i) unitarily invariant, 
(ii) jointly concave (convex) and (iii) invariant under the addition of an ancillary system, is 
also monotonically increasing (decreasing) under CPTP mapsf]. Clearly, since satisfies all 
the above-mentioned conditions, Eq. ( |3.11| ) also follows by simply squaring the corresponding 
monotonicity inequality for y/j". 



3.2.1.5 Related Metrics 

The fidelity by itself is not a metric. However, one may well expect that a metric, which is 
a measure of distance, can be built up from a measure of "closeness" such as 3". Indeed, the 
functionals 

A[3 r (p, cr)] := arccos 

V^M, (3.12) 



B[5{p,a)] := J2-2y/?(p,a), (3.13) 



C[?(p,a)}:= y/l-?(p,a), (3.14) 
exhibit such metric properties (see Refs. [^l], |102| , |103| , |104 |105| , |106|1 and also Appendix |B.3| 



for more details). In particular, these functionals are now commonly known in the literature 
respectively, as the Bures angle the Bures distance [|103j , |104j1 , and the sine distance [ [106 



4 This follows easily from the Stincspring representation of a CPTP map and from the representation of 



the partial trace operation given in Refs. [100, 101 1 
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3.2.1.6 Trace Distance Bounds 

An important distance measure in quantum information is the metric induced by the trace 



norm || ■ || tr (defined in Sec. |3.3.1.1|) , which is commonly referred to as the trace distance |2T 

V{p,<r) = \\\p-a\\ tv . (3.15) 
The trace distance is an exceedingly successful distance measure: it is a metric (as is any 



distance induced by norms), unitarily invariant ||107||, jointly convex [21], decreases under 



CPTP maps [|108|] and, in the qubit case, is proportional to the Euclidean distance between 



the Bloch vectors in the Bloch ball. The trace distance is also closely related to the minimal 
probability of error on attempts to distinguish between two non-orthogonal quantum states 
For all of these reasons, one is generally interested to determine how other distance 



measures relate with the trace distance. 

The following functions of the fidelity were shown in Ref. [ |1(J9| | to provide tight bounds 
for DQ: 

1 - y/?(p, a) < V(p, a) < y/1 - W(p~, a) . (3.16) 

In fact, the stronger lower bound 1 — 5F < D holds if p and a have support on a common 
two-dimensional Hilbert space [|110|1 (e.g., any pair of qubit states), or if at least one of the 
states is pure ] [2~T|j . 

From these inequalities, one can conclude a type of qualitative equivalence between the 
fidelity J and the trace distance D: whenever J is small, T> is large and whenever 5F is large, 
D is small. 

3.2.2 An alternative fidelity measure 
3.2.2.1 Preliminaries 

We shall now turn attention to our proposed alternative fidelity measure between two quan- 
tum states p and a, namely, 

2 N (p, a) = Tr [pa] + - TrpVl - Tr a 2 . (3.17) 

This is simply a sum of the Hilbert-Schmidt inner product between p and a and the geometric 
mean between their linear entropies. It is worth noting that the same quantity — by the 
name super-fidelity — has been independently introduced in Ref. [0 as an upper bound for 

Remarkably, when applied to qubit states, 3"^ is precisely the same as 3\ This observation 
follows easily from the fact that for density matrices of dimension d — 2, it is valid to write 



3V (p, a) \ d=2 = Tr [per] + pV det a , (3.18) 



Both inequalities in Eq. ( 3 . 1 6| ) are saturated if p = cr, and also if p and a have orthogonal supports. A 



less trivial example of saturation of the upper bound on D is obtained when both p and a are pure states, 
whereas the lower bound on D can only be (non-trivially) saturated in Hilbert spaces of dimension strictly 



greater than 2 (sec Ref. [110 for an example with d = 3). Moreover, it is not difficult to show that the 



equality 1 — T = T> holds true if [p, a] — and at least one of the states is pure. 



3.2 An alternative fidelity measure for quantum states 



39 



which is just an alternative expression of 3 r for qubit states [104, 1 



When d > 2, however, 3^ no longer recovers 3", but can be seen as a simplified version 



of the fidelity measure 3c- proposed by Chen and collaborators ||112|| , which reads as: 



Sc{p, °)= l -^- + ^MP, *)> (3-19) 

where r = \/{d— 1), and d is the dimension of the state space of p and a. Moreover, it 
is straightforward to verify that while jFjv reduces to the Schumacher's fidelity [the rhs of 
Eq. ( |3.7D 1 when one of the states is pure; the same cannot be said for $c- 

It is not difficult to see from Eq. ( |3.17 ) that 3jv satisfies Jozsa's axioms 2, 3, and 4 as 



enumerated in Sec. |3.2.1.1| . The non-negativity of 3^ required by axiom 1 is also immediate 
from the definition. As a result, 3"^ is an acceptable generalization of Schumacher's fidelity 
according to Jozsa's axioms if: 

Proposition 1. $ N (p,cr) < 1 holds for arbitrary density matrices p and a, with saturation 
if and only if p = a. 

Proof. To begin with, recall that any d x d density matrix can be expanded in terms of 
an orthonormal basis of Hermitian matrices {Afc}^, 1 such that Tr(AjAj) = Sij (see, for 



example, Refs. [|113| , |114|| ). In particular, if we let A := (Ao, ■ ■ ■ , X^-i), then p and a admit 
the following decomposition: 

p = r ■ A and a = s • A , (3.20) 

where r and s are real vectors with d 2 entries (corresponding to the expansion coefficients 
which can be determined using the orthonormality condition). Since p and a are density 
matrices, r and s satisfy < r ■ s < 1 and r, s < 1 where r = ||r|| and s = ||s||. 

Using the expansion of Eq. (|3.20|) in Eq. fl3.17| ), we arrive at the following alternative 
expression of 3^, 

f N {r, s) = r ■ s + Vl - rVl - s 2 (3.21) 
= R S, (3.22) 

where, in the second line, we have defined two unit vectors in M d2+1 , explicitly, 

R := (r, Vl - r 2 ) and S := (s,y/l - s 2 ) • (3.23) 

The normalization of R and S then implies that 3 r Ar(p, a) = R ■ S < 1, with saturation if 
and only if R = S, or equivalently p = a. □ 

3.2.2.2 Concavity Properties 



As with vj, the new fidelity measure 3 r jy is jointly concave in its two arguments, i.e., for 
V11V2 > 0, pi + p 2 = 1 and arbitrary density matrices pi, p 2 , o~\ and a 2 , we have, 



3n{v\P\ +P2P2,Pio~i +V20-2) >Vi3n{pi,v\) + P2^n(P2, o- 2 ) 



(3.24) 
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Since J fails to be jointly concave in general, SFjv has stronger concavity property. Remark- 
ably, given the equivalence between 5" and SFjv in the d = 2 case, the result of this section 
implies that 5F is jointly concave when restricted to qubit states. 

The rest of this section concerns a proof of this concavity property of 3^. We start 
by proving the following lemma, which provides a useful alternative expression of inequal- 
ity O. 



Lemma 3.1. Define a function F : [0, 1] — > M. by 



F(x) := (r + xu) ■ (s + xv) + \J\ — \\r 



xu\ 



\s + xv\ 



(3.25) 



Given density matrices p 1; p 2 , (J\ and a 2 , there exist vectors r,s,u,v £ ~R d2 and x £ [0,1] 
such that the inequality 

F(x) > (l-x)F(O) +xF(l) (3.26) 



is 



equivalent to Eq. (|3.24|) . 



Proof. The proof is by construction. Using the parametrization of Eq. ( 3.20 ) for the density 
matrices in inequality ( |3.24| ), we obtain the following equivalent inequality for the vectors T*j 
and Sj\ 



/n (Pi»*i + p 2 r 2 , PiSx + p 2 s 2 ) > Pi/jv(r"i, «i) + P2fN(r 2 , s 2 ) 



(3.27) 



where the function /jy was defined in Eq. ( 3.21 



A straightforward computation shows that inequality ( |3.26| ) is identical to inequality 
(|3.27|) when we identify x = p 2 , 1 — x = pi, and set 



r 

s 



ri , 
si , 



u 

V 



r 2 

S2 



ri , 
si ■ 



(3.28) 



□ 



If F(x) has negative concavity in x £ [0, 1], then the inequality (|3.26j ) is automatically 
satisfied as it establishes that the straight line connecting the points (0, F(0)) and (1, F(l)) 
lies below the curve {(x, F(x))\x £ [0, 1]}. As a result, the joint concavity of £Fjv is proved 
with the following proposition: 



Proposition 2. For x £ [0,1], and r , s, u, v £ M^ 2 specified in Eq. ( 3. 28 ), the function 
F(x) [cf. Eq. ft3~2b] )J satisfies 

d 2 F(x) 



dx 2 



< 



(3.29) 



and hence 3^n is jointly concave. 



The proof of this Proposition is given in Appendix |B.l 
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3.2.2.3 Multiplicativity under Tensor Product 



In contrast with 3 r , the new fidelity measure jFjv is not multiplicative under tensor products. 
In fact, it is generally not even invariant under the addition of an uncorrelated ancilla 
prepared in the state r. In this case, jFjv between the resulting states reads as: 



JjvO ® t, a <8> r) = Tr [pa] Tr r 2 + y 7 ! - Tr p 2 Tr rVl - Tr a 2 Tr r 2 , 



(3.30) 



where the lhs equals 3 f N(p, a) iff Tr r 2 = 1, or in other words, iff r is a pure state. More 
generally, it can be shown that £Fjv is super-multiplicative, i.e., 



9at(Pi ® P2, 0"i <8> cr 2 ) > 9V(pi, o r i)9 r Af(P2, cr 2 ). 



(3.31) 



A proof of this property is given in Appendix |B.2| ; a similar proof was independently obtained 
in Ref. |B1. 



3.2.2.4 Monotonicity under Quantum Operations 

That SFjv is only super-multiplicative may be a first sign that it may not behave monotonically 
under CPTP maps. In fact, as we shall see below, Ozawa's counter-example ||115|| to the 



claimed monotonicity of the Hilbert-Schmidt distance [IT5 can also be used to show that 
3tv does not behave monotonically under CPTP maps. 

Let p and a be two two-qubit density matrices, written in the product basis as 



/ 1 \ 

10 



\ / 



and a 



( \ 



10 

\ 1 / 



(3.32) 



and consider the (trace preserving) quantum operations of tracing over the first or the second 
qubit. A straightforward computation shows that if the first qubit is traced over, then 



^(Tri p, Tri cr) = 1 > \ = 3 N (p, a) 



(3.33) 



which satisfies the desired monotonicity property. However, if instead the second subsystem 
is discarded, we find 

^(Tr 2 p, Tr 2 a) = < \ = ? N {p, a) . (3.34) 

Together, Eqs. ( |3.33|) and ( |3.34j ) show that jFjv is neither monotonically increasing nor 
decreasing under general CPTP maps. 

A natural question that follows is whether 3^ features a weaker form of monotonicity. For 
example, do arbitrary projective measurements — with the measurement outcomes forgotten 
- give rise to higher value of 3^ for the resulting pair of states? An affirmative answer 
would follow from a proof of the inequality 



(3.35) 
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for any complete set of orthonormal projectors Pj, and for arbitrary density matrices p and 
a. 

It is a simple exercise to prove Eq. (|3.35|) for the particular case where either of the 



commutation rules [Pj, p] = or [Pj, ex] = is observed for all values of i. Whether the same 
conclusion can be drawn from the more general, non-commutative cases remains to be seen. 
In this regard, we note that a preliminary numerical search favors the validity of Eq. ( |3.35| ). 

3.2.2.5 Related Metrics 



In parallel to the metrics A[3], B[3] and C[&\ introduced in Sec. |3.2.1.5| , we define 



A[3 r N (p, cr)] := arccos \/Wn{p, cr), (3.36) 



B[? N {p, a)] := ^2 - 2^? N (p,a), (3.37) 
C[$ N {p, cr)] := y/l-? N (p,a), (3.38) 

and prove that while C[3V| preserves the metric properties, both and P[3V] do not 

always obey the triangle inequality 

X[2 N (p, a)] < X[? N (p, r)] + X[? N (r, cr)] , (3.39) 

where X here refers to either A, B or C. For example, consider the qutrit density matrices, 
P = Is/3, 

' 0.90 0.04 0.03 
0.04 0.05 0.02 I . (3.40) 
0.03 0.02 0.05 





r 1 









-1 











and t = I 




^ 





) 





Table 3.1: A numerical test of the triangle inequality for ApV], Pt^Ar] and Cf^]. 



X 




X[? N (p,T)}+X[? N {T,a)} 


A 


0.9553 


0.9241 


B 


0.9194 


0.9137 


C 


0.8165 


0.8828 



Numerical computation of the quantities appearing in the triangle inequality gives rise to 
Table ft.l| . Note that for X = A,B, the first column dominates the second, i.e., the triangle 
inequality is violated and therefore neither A[jFtv] nor P[9V] are metrics. For X = C, no 
violation is observed for the above density matrices. Next, we prove that this is the case for 
any three density matrices p, a and r, thus C[5Fjv] is a metric. 

Proposition 3. The quantity C[$n(p, &)] is a metric for the space of density matrices. 



To prove this proposition, we will make use of the following theorem due to Schoen- 
berg ||117|1 (see also [|118| , Ch. 3, Proposition 3.2]). We state here an abbreviated form of the 
theorem sufficient for our present purposes. 
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Theorem 3.1 (Schoenberg). Let X be a nonempty set and K : X x X — > R a function such 
that K(x,y) = K(y,x) and K(x,y) > with saturation iff x = y, for all x,y G X. If the 
implication 

n n 

Ci = => K ( x i> x j)°i c j < ( 3 - 41 ) 

i=l i,j=^ 



holds for all n > 2, {x\, . . . , x n } C X and {ci, . . . , c n } C W, then V K is a metric. 

We make a small digression at this point to remark that, in spite of its successful appli- 
cation in the grounds of classical probability distance measures |[L19| , |120| , |121|| , Schoenberg's 



theorem has received almost no attention by the quantum information community. In this 
thesis, besides proving the metric properties of (7(3^], we will also make use Schoenberg's 
theorem to provide independent proofs of the metric properties of B[1(p,a)}, C[3 f (p, a)] 
(Appendix |B.3| ), and of the squared Hilbert-Schmidt distance, to be introduced later (Ap- 
pendix p~4"D . 



Proof of Proposition^. Clearly, from the definition of C 2 [3 r N(p,a)\, it is easy to see that 
it inherits from SFjv(p, er) the property of being symmetric in its two arguments, and that 
C 2 [9 r Ar(p, a)] > with saturation iff p = a. So, to apply Theorem |3TT| , we just have to 
show that for any set of density matrices {pj}™ = i { n > 2) and real numbers {cj}f =1 such that 
Yli=i c i = 0' ^ 1S ^ rue ^at 



(3.42) 



This follows straightforwardly by exploiting the zero-sum property of the (real) coefficients 
Cj and the linearity of the trace, 



Tr 



which concludes the proof. 



n , 

n 

E' 



C{Cj 



E 



Tr pi 



,i=i 



(3.43) 



□ 



We note that a proof of the metric property of \/2C[9 r N(p, a)] - by the name modified 
Bures distance — was independently provided by Ref. ||98|| . The proof provided above is 
significantly shorter thanks to the power of Schoenberg's theorem. 



3.2.2.6 Trace Distance Bounds 



In Sec. |3.2.1.6| , we have seen that a kind of qualitative equivalence between D and 3 can be 
established through the bounds on D given by functions of 3 r , c.f. Eq. (|3.16|) . Here, we will 
provide similar bounds on D in terms of functions of jFjv- 
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Proposition 4. For any two density matrices p and a of dimension d, the trace distance 
D(p,o~) satisfies the following upper bound: 



T>(p,a)<^-^l-? N (p,a), (3.44) 

where t := rank(p — a). Moreover, this upper bound on T> can be saturated with states of the 
form 

E/diagjAj W Udmg[P(A d )]W 
H Tr{diag[A,]} Tr {diag [A d ]} 1 ; 

where U is an arbitrary unitary matrix of dimension d, is an ordered list of d elements 
taking values in the set {Ai, A2} fAi, A2 > 0, but not simultaneously zero) and -P(A^) is the 
list formed by some permutation of the elements in A^. 

Proof. Note that the product of square roots in the expression of £Fjv, Eq. ( |3.17|) , is the 
geometric mean between the linear entropies of p and a . It then follows from the inequality 
of arithmetic and geometric means that 

1 TV 1 TV 

— P - + > x/l-TrpVl-Tra 2 , (3.46) 

which can be reexpressed as the following inequality after summation of Tr [pa] to both sides, 

IIP - crllns < ^2 [l-5 N (p,a)]. (3.47) 

Here, ||A"|| H s := V^ 1 \ X]X \ is the Hilbert-Schmidt norm (also known as Frobenius norm), 
defined for an arbitrary matrix X. The Hilbert-Schmidt norm and the trace norm ||X||t r : = 
Tr \J X^X are related according toQ 

||X|| tr < v^HXHhs, (3.48) 
where j; := rankX. Used in Eq. (|3.47|) , the above inequality leads to the desired result 



D(p, a) = \\\p- a\\ tT < yjlyfl - ? N (p, a) . (3.49) 
To prove that the states in Eq. ( |3.45| ) saturate this bound, we first note that because 



those states are isospectral, their linear entropies are identical and hence inequality ( 3.46 ) 
is saturated. To prove saturation of inequality ( |3.48| ), simply use Eq. ( |3.45| ) to compute 



\p - a\\ tI = Tr V(P - -) 2 = Tr r [ A d ; ag ^ ]r (3-50) 



||P-^||hs = \/ Tr [(p- a ) 2 ] = FTTT ; ( 3 - 51 ) 

V Tr{diag[A rf J} 
from which the identity \\p — cr|| tr = v^IIp ~~ °"||hs is immediate. □ 
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(a) For d — 3, a gap can be clearly noticed between 
the distribution of states and the absolute upper 
bound, i.e., the rhs of inequality ( [3.44 ) with r = d. 
Such a gap occurs whenever d is odd. 
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(b) For d = 6, no gap is observed between the 
bulk of randomly generated states and the absolute 
upper bound. In fact, this bound can be saturated 



by density matrices of the form given by Eq. (3.45) 
whenever d is even. 



Figure 3.1: (Color online) Plot of the trace distance D(p,a) vs 1 — &n(p, a ) f° r 4 x 10 6 pairs 
of randomly generated p, a with d = 3 and d = 6. The darker (blue) points are generated using 
pairs of mixed states whereas the lighter (green) points are generated using at least one pure state. 
The anti-diagonal solid line is the conjectured lower bound whereas the upper bounds given by 
Eq. ( |3.44| ) are represented by the dashed curves (cyan) — one for each integer value of r G [2,d]. 



How good are these upper bounds? With some thought, it is not difficult to conclude 
that the states arising from Eq. (|3.45|) can only have even t, and are thus unable to saturate 
the upper bound of Eq. ( ft .44]) for odd r. Nonetheless, from our numerical studies, it seems 
like the absolute upper bound — corresponding to the choice r = d in the rhs of Eq. (|3.44j) 
- is actually unachievable by any states if d is odd. An illustration of this peculiarity can 
be seen in Fig. ft.l(a)| , where the upper bound corresponding to r = 3 is well separated from 
the region attainable by physical states. In contrast, for every even d, the states given by 
Eq. (|3.45|) do trace out a tight boundary for the region attainable with physical states, as 
shown in Fig. |3.i(b)| for d = 6. 



On the other hand, it can also be seen from Fig. |3.1| that no points occur in the region 
where D < 1 — jFjy Indeed, intensive numerical studies for d = 3, 4, . . . , 50 have not revealed 
a single density matrix which contributed to a point in this region. This suggests that the 
following lower bound on D, in terms of 3^, may well be established^ 

Conjecture 3.2.1. The trace distance D(p, a) and the fidelity ^(p, a) between two quantum 
states p and a satisfy 

T>(p,o-)>l-? N (p,a). (3.52) 



6 To see that, assume, for simplicity, that AT is a square matrix of dimension d and let A <E M. d be the 
vector with entries Ai > A2 > . . . > Ad corresponding to the singular values of A. In addition, let v £ M. d be 
the vector with the first y = rank A entries equal to 1 and the remaining d — y entries equal to 0. Then, it 



follows that ||A||t r = |A • v\, ||A||hs = ||A|| and ^/y = ||u||. In this framework, inequality (3.48) is equivalent 
to Cauchy-Schwarz inequality applied to A and v, i.e., |A • v\ < [|A|| 



7 After the publication of Rcf. |l|, this conjecture was proved by Puchala and Miszczak in Ref. [ 1 22 1 
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In relation to this, it is also worth noting that the following (weaker) lower bound can 
readily be established via a recent result given in Ref. [B3]: 

Proposition 5. The trace distance T>(p,a) and the fidelity Jn(p, o) between two quantum 
states p and o satisfy the following inequality. 



D(p,<r)>l 



(3.53) 



Proof. This lower bound on D follows immediately from the lower bound on T) given in 
inequality fl3.16|) and the inequality 3 < $n recently established in Ref. |^| . □ 

As with the fidelity we can thus infer that whenever jFjv is large enough, T) is close 
to zero and whenever 9^ is close to zero, T> is close to unity. However — as should be clear 
from Fig. 3.1(b) 



the converse implication is not necessarily true. 
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Figure 3.2: (Color online) Plots of the average computation time for the fidelity functions $ 
(0)> (x), the nonlogarithmic variety of the quantum Chernoff bound Q (+), and the trace 
distance D (□) as a function of the dimension d of the state space. Computations were performed on 
a 2.6 GHz Intel Pentium 4 CPU. (a) The data is presented in a semilog plot where the smaller and 
larger markers correspond to timings from Matlab and C respectively, (b) By plotting ln(i) x ln((f) 
for 55 < d < 100 and timings from the C code, we obtain straight lines whose angular coefficients 
(m) quantify the "practical complexity" for computing each considered measure. 



3.2.3 Computational Efficiency 

For two general density matrices p and a, analytical evaluation of the fidelity &(p, o~) can be 
a formidable task. This is in sharp contrast with 9^0°; a ) which involves only products and 
traces of density matrices. Even at the numerical level — due to the complication involved 
in evaluating the square root of a Hermitian matrix — the computation of $(p, o~) can be 
rather resource consuming. For a quantitative understanding of the computational efficiency, 



3.2 An alternative fidelity measure for quantum states 



47 



we have performed a numerical comparison of the time required to calculate the fidelities 
5F and 3"^, the trace distance T), and the nonlogarithmic variety of the quantum Chernoff 
bound Q := min < s <i Tr(p s cr 1_s ). We have implemented the computations in both Matlab 
and C; we present the Matlab codes for reasons of accessibility and succinctness, while the 
C codes provide more accurate timings without the overhead of the Matlab interpreter. 

The time required to evaluate each function was estimated by averaging the times for 
100 pairs of randomly generated <i-dimensional density matrices^]. Results are shown in 
Fig. p. 2(a)| as a function of d. The Matlab codes are presented in Appendix |B.5| we attempted 



to make these codes as efficient as possible within the constraints of the Matlab environment. 
Corresponding C codes were implemented as Matlab MEX-files for convenience and can be 
found online ||124|| . Our C implementation directly calls the LAPACK and BLAS libraries 



included in the Matlab distribution for eigenvalue decompositions and matrix operations. 
The minimization required in the computation of Q was performed using the Brent minimizer 
from the GNU Scientific Library [125]. 



The results shown in Fig. |3.2(b)| display some consistency with the expected algorithmic 
complexity. From the figure, one sees that our C codes for computing 3" and Q require ap- 
proximately 0(d 2,r ) operations for values of d e [55, 100]. This is in good agreement with the 
theoretical asymptotic performance, since 3" and Q require two Hermitian diagonalizations, 
taking an expected 0(d 3 ) operations each ||126|| . Computing Q is slowest since it requires 
both sets of eigenvectors, while 3" requires only eigenvalues from one of the diagonalizations. 

Next fastest is the computation of D, which requires only eigenvalues from a single 
diagonalization. In this case, Fig. [3.2(b)| suggests that only 0(d 23 ) operations are required 



to compute the trace distance between density matrices of dimension d E [55,100]. Note, 
however, that this is significantly less than the expected 0(d 3 ) operations, typical from 
computations involving matrix diagonalization. Such a discrepancy can be understood as 
follows: since D can be computed considerably faster than Q or 3", the time taken by 
other spurious machine processes (not intrinsically related to the execution of our algorithm) 
becomes relatively important, compromising the accuracy of our timings. This is particularly 
significant in timing 3"^, our fastest-to-compute measure. In principle, one should expect an 
asymptotic performance 0(d 2 ), since 3^ requires only three Hilbert-Schmidt inner products. 
However, Fig. |3.2(b)| suggests that 0(d 26 ) operations are required. 



Nevertheless, Fig. |3.2| clearly shows that the practical numerical evaluation of 3^ is 
dramatically faster than the evaluation of 3", D or Q. This raises the prospect of using 3jv 



as a numerically efficient estimate of distance measures such as 3" [98] and D — particularly 



for small d where the bounds proven in Sec. [3.2.2.6| are tighter. As the dimension increases, 



the computational advantage of using 3^ becomes even greater, but the quality of the 
estimate drops. 



8 Here, we follow the algorithm presented in Rcf. [123] to generate d-dimcnsional quantum states. In 
particular, the eigenvalues {Aj}f =1 of the quantum states were chosen from a uniform distribution on the 
d-simplcx defined by J2i Ai = 1. 
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3.2.4 Concluding Remarks 

In the previous sections, we have proposed an alternative fidelity measure, 3jv, between an 
arbitrary pair of mixed quantum states. This new measure, together with the prevailing 
fidelity 3" and the nonlogarithmic variety of the quantum Chernoff bound Q |81[] are, to the 
best of our knowledge, the only known distance measures between density matrices that 
comply with Jozsa's axioms [[77] . That is, 5F, Q, and 3jv are the only known measures that 



generalize to pairs of mixed states the concept of fidelity introduced by Schumacher between 
a pure and a mixed state |97| . 



The simplicity of JFjv is in sharp contrast with 3 and Q since it involves only products 
of density matrices. Numerically, this leads to significant reduction in computation time for 
(p, a) over 5F(/?, a), especially for higher dimensional systems. 

Besides being easier to compute, 3^ has also been shown to preserve (and even enhance) 
a number of the useful properties of 3" and Q. For example, we have shown that 3^ is a 
jointly concave measure, that it can be used to place upper and lower bounds on the value 
of the trace distance and that it gives rise to a new metric for the space of density matrices. 
A remarkable consequence of the joint concavity of 3^ is that 3" is also jointly concave when 
restricted to a pair of qubit states — an interesting problem which remained unsolved thus 



far [127, 128 



The new measure, nevertheless, is not without its drawbacks. To begin with — 3"^, unlike 
measures such as 3" or Q — does not behave monotonically under completely-positive-trace- 
preserving (CPTP) maps. In addition, it does not necessarily vanish when applied to any 
pair of mixed states which are otherwise recognized to be completely different according to 
3", Q or their trace distance T>. In fact, the explicit dependence on the linear entropies of p 
and a gives rise to the following undesirable feature: the value of 3^ between two completely 
mixed states living in disjoint subspaces can get arbitrarily close to unity as the dimension 
of the state space tends to infinity. 

The undesirable features of 3jv provide a clue as to when 3"^ may not be the preferred 
measure of "closeness" between two quantum states: We know that 3at does not measure 
the "closeness" between two high-dimensional, highly mixed states (i.e., states having non- 
negligible linear entropy) in the same way that measures like 3", Q or T) would. In these 
cases, the interpretation of 3"^ as a measure of proximity between quantum states must be 
carried out with extra caution. 

With this in mind, we nevertheless see 3jv as an attractive alternative to 3\ Even when 



out of its range of applicability, it follows from a very recent result of Miszczak et al. |98| that 
3jv provides an upper bound on the Uhlmann-Jozsa fidelity 3". Moreover, it seems promising 
that 3tv between any two quantum states may be measured directly in the laboratory, 
without resorting to any state tomography protocol |98[ . 



Let us now briefly mention some possibilities for future research that stem from the 
present work. To begin with, it would be interesting to search for a quantitative relationship 
between 3"^ and Q analogous to that between 3jv and D established here, or that between 
3tv and 3" given in Ref. [pq| . An estimate of Q based on some function of 3"^ would be 
useful given that a closed form for Q is not currently known, and that 3"^ can be computed 
relatively easily. In addition, assuming 3"^ as an alternative to 3", it seems reasonable to 
revisit some of the problems where 3^ has proven useful, but with 3"^ playing its role. In 
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particular, it would be interesting to investigate whether the simplicity associated with Ujv 
will offer some advantages over 3\ 

As a first example, we recall from Ref. that a standard measure for the amount of 
entanglement of a state p is given by the shortest distance from p to the set of separable 
density matrices. Given the relative simplicity of 3^ with respect to it is not inconceiv- 
able that a distance measure based on £Fjv (such as CfJJjv]) m &y lead to a more efficient 



determination of this quantity if compared, for example, to C[J] or the Bures distance |85 



Of course, any serious attempts in this direction should be preceded by further investigation 
of the impact of the nonmonotonicity of £Fjv under CPTP maps |84[ . 

As another example, 3^ can be used as a figure of merit in designing optimized quantum 
control and/or quantum error correction strategies: One is typically interested in determining 
a quantum operation C that minimizes the averaged distance between the elements of a 
sequence of noisy quantum states pi and a pre-defined sequence of target quantum states <7j. 
In this context, it would be interesting to investigate if distance measures based on 3 r jv would 
lead to any advantage in terms of computation time. Clearly, this has potential applications 
to the implementation of real time quantum technologies. 

Yet another possible direction of research consists of employing 3jv as a distance mea- 
sure between quantum operations — as opposed to quantum states — via the isomorphism 
between quantum states and CPTP maps |50, |5T[j. In this regard, it is worth investigating 



whether distance measures based on 3jv would satisfy the six criteria proposed in Ref. [ |105 
Remarkably, from the results of the present work and Ref. |98| , a few strengths of S^-based 
measures can already be anticipated. Of special significance are the fulfilment of the criteria 
"easy to calculate" and "easy to measure" . Along these lines, some operational meaning for 
Jtv would also be highly desirable. Although we do not presently have a compelling physical 
interpretation of £Fjv, it is not inconceivable that one can be found in an analogous way to 
3 r @j0|. 



3.3 Metrical distance measures 

In this section we study three distance measures on the set of the density matrices that 
are truly metrics. The fulfilment of the metric axioms is safeguarded by the fact that these 
distances are "induced by norms" . In order to make this notion clearer, let us start with the 
following definition: 

Definition 3.1. Given a vector space V over C, a function || • || : V — > R is called a norm 
if and only if, for every v, w G V and A G R 

(Nl) \\v\\ > (Nonnegativity) , 

(N2) || v || = iff v = 0, 

(N3) || Aw || = |A|||f|| (Positive Scalability), 

(N4) \\v + w\\ < \\v\\ + ||iw|| (Triangle Inequality) . 
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The quantity @(v,w) : = \\v — w\\ induces a notion of distance between the elements v 
and w which is, indeed, a metric "induced by the norm || ■ ||". The metric axioms on page [TI] 
can be easily verified: 

(Ml) (Nonnegativity): that @(v,w) > follows trivially from (Nl); 

(M2) (Identity of Indiscernibles) : that @(v, w) — iff v — w follows trivially that from (N2); 

(M3) (Symmetry): that @(v,w) = @(w,v), follows from \\v — w\\ — || — (w — v)\\ and (N3); 

(M4) (Triangle Inequality): that @(v, w) < 3>{y, u) +@(w, u) follows by replacing v — ► v — u 
and w — >• u — w in (N4), for all u E V. 



In Sec. |3.3. 1| we introduce three well established metrics for the space of density matrices: 



the trace norm, the Hilbert-Schmidt norm and the spectral norm. In Sec. |3.3.2j these mea- 



sures are evaluated against the same criteria discussed in the previous section. Table |37|, on 



page 56, summarizes the main results of this and the previous sections. 



3.3.1 Three metrics for the space of density matrices 

A standard family of norms for the algebra of matrices was introduced by Schatten in 



Ref. ||129|| (see also Ref. [107]). For any value of p E [1, oo], the Schatten p- norms are defined 

as 

\\A\\ P = [Tr (\A\ p )}p for pG[l,oo) and WA^ = ||A|| , (3.54) 



where A G Md 1 xd 2 an d \\A\\ is the standard operator norm of A [cf. Eq. ( |3.62| )]. 



In this section, we shall focus on the metrics induced by the Schatten p-norms with 
p = l,2,oo. In order to avoid notational confusion with the so-called Ky Fan /c-norms^|, 
we adopt the alternative nomenclature trace norm \\ ■ || tr , Hilbert-Schmidt norm || • ||hs an d 
spectral norm || • || for the Schatten 1-, 2- and oo-norms, respectively. 

3.3.1.1 The Trace Distance 

From Eq. ( 3.54|) , the trace norm of A is simply ||A|| tr := Tr V A^A. Although the trace 



distance — the metric induced by the trace norm — should then be given by \\A — B\\ tT , it 
is a common practice (adopted here) to define it as half of this number. We have already 
defined the trace distance between two density matrices in Eq. ( |3.15 ). Below, we exploit the 



above formula for the trace norm and the hermiticity of density matrices to write 

D(p, a) := \ \\p - a\\ tr = \ Tr [^(p - a) 2 ] . (3.55) 

A number of alternative definitions of the trace distance are also known. For example, 

V(p,a):= max | Tr [W{p - a)} | , (3.56) 

W&U(d) 



9 These are the sum of the k largest singular values of A. 
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where U (d) denotes the group of unitary matrices of dimension d. Remarkably, the maxi- 
mizing W satisfies W(p — a) = \p — a\ (see [77], Lemma 6] or | |129| , pp. 43-44] for a proof), 
so that we recover the definition of Eq. (|3.55|) . 

While restricted to the space of density matrices, the trace distance can also be defined 

D(p, a) := max Tr \P(p - a)] , (3.57) 

0<P<l rf 



or, alternatively, the optimization can be taken over all projectors P [21 , pp. 404-405]. 

As a final observation, we note that the trace distance is equal to half of the sum of the 
singular values of p — a; or what amounts to be the same in the case of hermitian matrices, 
half of the sum of the modulus of the eigenvalues. This follows easily from the singular value 
decomposition p — a = UHV\ where U and V are unitary matrices and E is a diagonal PSD 
matrix. Substituted into Eq. (|3.55|) , we get 



D(p,<7):=iTrE, (3.58) 

establishing the claimed result. This definition motivates the Matlab code shown in the 
Appendix [B.5| for the numerical computation of D. 



3.3.1.2 The Hilbert-Schmidt Distance 

Whenever an inner product is defined on a set, a norm can be immediately defined for each 
element of the set via the square-root of the inner product of that element with itself. In 
this framework, the Hilbert-Schmidt inner product of a matrix A with itself gives rise to the 



Hilbert-Schmidt norm ||A||hs := y Tr [AtAJ. Clearly, this is precisely the norm arising from 
Eq. ( gjjjD with p = 2. 



The Hilbert-Schmidt distance between two density matrices p and a is defined as 



X(p, a) :=\\p- a\\ m = ^Tr [(p - a) 2 } . (3.59) 

From Eq. ( |2.21|) , it then follows that the Hilbert-Schmidt distance is the Euclidean norm of 
vec (p — a), namely 

%{p, a) := \J [vec(p - a)^ vec(p - a) . (3.60) 

This implies, for example, that \\p — ct||hs is merely the sum of the absolute values squared of 
every entry of p — a. Moreover, Eq. ( p.60|) motivates the Matlab code shown in Appendix p. 5 



for the computation of the Hilbert-Schmidt distance. 

Yet another useful definition arises from the singular value decomposition p — o~ = ITEV' 



applied to Eq. (|3.59|) . A straightforward calculation shows that 

% 2 {p,a) := Tr (S 2 ) , (3.61) 



10 Note that for an invertible matrix p — a, is the unitary arising from the (unique) left polar decom- 
position of p — ct |^l|, Theorem 2.3]. For non-invertible p — a. W is not unique but it does exist (see js^, p. 
53] and references therein). 

11 See [130, Lemma 4] for a closely related definition of the trace distance for arbitrary hermitian matrices. 
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or in words, the squared Hilbert-Schmidt distance between p and a is the sum of the squared 
singular values of p — a. Due to the hermiticity of density matrices, we can make a further 
simplification and regard Ji 2 {p, ex) as the sum of the squared eigenvalues of p — o. 

It is interesting to note that although "K is the induced metric by the Hilbert-Schmidt 
norm, the function "K 2 can also be shown to be a metric. This is proved in the Appendix [B.4 . 



3.3.1.3 The Spectral Distance 

The dual norm Hi 3 HI of the trace norm is the so-called spectral norm (also known as operator 



norm, Schatten infinity norm, etc). It is defined for an arbitrary matrix A as 

Mil := max \\Av\\ (3.62) 
H=i" 

where the norms appearing on the right hand side refer to the Euclidean norm for vectors 
v G C d . 

From this variational definition, it is possible to show that the spectral norm of A is equal 
to the largest singular value of A, i.e., for the SVD A = UHV\ is the largest element 
of E. We can restate this in terms of the eigenvalues of the matrix A* A = VY?V^ . Clearly, 
each diagonal element of S 2 is an eigenvalue of A^A, so we can write 



\\A\\ := VAmax WA] = A max VIU = A max |A| , (3.63) 

where A max is an operator that extracts the largest eigenvalue of its argument. The spectral 
distance between two density matrices p and a is thus defined as 

0(p,<j) =A max |p-a| (3.64) 

Since p — a is a normal matrix (it is actually hermitian), simple diagonalization shows that 
the eigenvalues of \p — a\ are simply the modulus of the eigenvalues of p — a. This observation 
leads to the Matlab code given in Appendix p.5| for the computation of 0. 



3.3.2 Benchmarks of metrical distances 

In this section we present an analysis of the metrics introduced above that parallels the 



study of the properties of J and SFjv presented in Sec. 3.2, The reader will note, however 



one omission: we do not evaluate the metrics against the criterion "Consistency with Schu- 
macher's fidelity". Of course, given that metrics are measures of distance, they should not 
be expected to recover a measure of closeness in some special case. 



3.3.2.1 Jozsa's Axioms 

1. Normalization. The metric axioms (Ml) and (M2) guarantee that for any metric 
we have \\p — a\\ > with saturation iff p = a. Noticeably, this establishes a slightly 
different normalization axiom than that one satisfied by the fidelity-like quantities 
3"7\r and Q. While these saturate their upper bounds when the states are identical, the 
metrical quantities saturate their lower bounds in this case. Of course, this poses no 
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conceptual difficulties and is merely a manifestation of the fact that the fidelity-like 
functions are closeness measures, while metrics are authentic distance measures. 

A few comments regarding the saturation of the upper bounds are pertinent: From 
the inequalities ( |3.16| ), it is easy to prove that D(p, a) = 1 if and only if 5F(p, a) = 0, 



which implies that the trace distance upper bound is achieved with and only with a 
pair of orthogonal states. On the other hand, orthogonality does not suffice for the 
saturation of the upper bounds of % and 0. For example, consider the orthogonal 
mixed states p and a of Eq. ( |3.32| ). It is easy to compute that CK(p, a) = 1 < V2 and 
0(p, a) = 1/2 < 1, where the rhs of the inequalities indicate the actual upper bounds 
of % and 0. 

A little thought shows that the upper bound of the spectral distance is achieved if one 
of the states is pure and orthogonal to the other state (which is allowed to be mixed). 
The upper bound of 3~C(p, 0), in turn, is saturated if and only if J N (p,o~) = [this 



follows from the third inequality in Eq. ( |3.69|) 1, or equivalently, if and only if p and a 
are pure and orthogonal. 

2. Symmetry. By definition, every metric is symmetric [cf. axiom (M3) on page |3~H . 

3. Unitary Invariance. The three metrics studied here are unitarily invariant. This 
follows from the fact they can be solely expressed in terms of the singular values of 
p — a, as shown in Eqs. ( |3.58| ), (|3.61| ) and (|3.63| ). Since the singular values of a matrix 
are invariant under unitary transformations, so are these metrics. 



3.3.2.2 Convexity Properties 

Every induced metric is jointly concave. This is proved in the following via a straightforward 
application of the norm axioms triangle inequality (N4) and positive scalability (N3). 

For any non-negative scalars p\ and p 2 (no need to require p\ + P2 = 1) and density 
matrices p±, p 2 , o~\ and <t 2 we can write 

||(PiPi +V2P2) - (pi0"i + p 2 o"2) || = |bi(pi - 0"i) +P2(p2 - ct-OH ( 3 - 65 ) 

< \V\{P\ ~ + IMP2 - 02)11 (3.66) 
= - (Till +P2IIP2 - 02 1 1 (3.67) 

which establishes the desired property. Contrasted to the effort involved in the proof of joint 
concavity for 3^, the above proof reveals the value of the underlying structure of induced 
metrics. 



3.3.2.3 Multiplicativity under Tensor Product 

With numerical examples, it is straightforward to see that none of the three metrics is 
multiplicative. In fact, both the trace norm and the operator norm are known to be super- 
multiplicative under tensor product ||132 |, which implies the supermultiplicativity of T) and 
0. Although this also seems to be the case for "K, we have not been able to find or produce 
a proof of this fact. 
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3.3.2.4 Monotonicity under Quantum Operations 



In Ref. ||108|| , Ruskai proved that D monotonically decreases (contracts) under arbitrary 
CPTP maps. 

The monotonicity of "K has a somewhat longer history. In the quantum information 
literature, this property was recognized as desirable feature for entanglement quantification 
in Refs. [[84], ^5] . In Ref. ||116|| , a flawed proof of the contractivity of "K was given; the error 
was detected by Ozawa in Ref. [115|, who provided an example of a CPTP map and a pair 



of 4 x 4 density matrices for which !K was seen to increase^. 

In Ref. |133| , Perez-Garcia et al. showed that neither CK nor are generally contractive 
under arbitrary CPTP maps, but both of them are if restricted to the subset of unital CPTP 
maps^|. Moreover, in the case of qubit states, both !K and are contractive under arbitrary 
CPTP mapsQ, but this already fails to be true for "K in the case of qutrit states. 



3.3.2.5 Bounds 



We have already shown in Eq. ( 3.16|) how D and £F are related, and in Eqs. ( 13.441 ), (p. 52 ) and 
( |3.53|) how this relation can be modified to place bounds on T) via functions of 9^- Here, 
we prove the following inequalities between D, IK and 0: 



0(p,a) < M(p,cr) < 2T>(p,a) < y/x3i(p,o) < tO(p,a) 



(3.68) 



where r := rank(p — a). 

Apart from mutually relating the metrical distance measures, the sequence of inequalities 
above can be used in connection with one of the inequalities ( |3.16|) , (|3.47| ), ( [3.52j ) or ( |3.53|) 



to relate each of metrics T>, % and with 5" or 3^. 
show that 

2 



For example, it is straightforward to 
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(3.69) 



where the first inequalities in the third and fourth lines are conditioned on the validity of 
conjecture [3.2.1| , but guaranteed to hold if jFjv is replaced with y^v- 

We now prove the inequalities in Eq. ( 3.68|) which are, in fact, a simple restatement of 
well-known inequalities between the Schatten p-norms applied to the matrix p — a. The 
first inequality, for example, is the particular case A = p — a of the more general inequality 
1 1 A 1 1 < ||A||hs- To see that this holds, square and express each norm in terms of the singular 
values of A, <Ji(A), to get 

maxai(A) 2 < y^ai(A) 2 . (3.70) 



12 Rccall that with this same example we have shown in Sec. 3.2.2.4 that 5Fjv is not monotonically decreasing 
under arbitrary CPTP maps. 



13 A unital map U is characterized by having the identity matrix as a fixed point, i.e., W(ld) 
14 This fact had already been anticipated by Nielsen in Ref. [ 134 1 for the case of !K. 
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This is obviously true since the term in the left is only one of the many non-negative sum- 
mands in the right. 

Likewise, the second inequality follows from ||v4||hs < ll^lltr- This can be proved by 
squaring and using the singular value expressions to obtain 



< 



(3.71) 



which clearly holds since the right hand side is a summation of non-negative terms involving 
every element appearing in the sum on the left hand side. 

The third inequality follows from ||A|| tr < A/rank(A)||A||Hs ; which has already been 
stated and proved in this thesis [cf. Eq. ( 3.48|) and the footnote on page 55]. 

Finally, the fourth inequality follows from ||A||hs < \/rank(A)||A||. Once again, this can 
be proved by squaring and expressing the norms in terms of the singular values, 



E(Ti(i) 2 < rank(A) maxci-j(A) 
7 



(3.72) 



which is trivially true since rank(A) equals the number of non-zero singular values of A. 



3.3.2.6 Computational Efficiency 



As discussed in Sec. |3.2.3| , the algorithmic complexity in the computation of the trace distance 
is 0(d 3 ) due to the need of diagonalization of a matrix. However, this is considerably more 
efficient than other measures such as 3" or Q since it only requires the computation of the 
eigenvalues (without eigenvectors). Clearly, the same applies for 0. 

In contrast, the definition of CK from Eq. (|3.60|) enables its computation with 0(d 2 ) 
operations. "K is thus the most efficiently computable metric among the ones considered 
here. 



3.4 Distances between sequences of density matrices 

So far in this chapter, we have studied a number of distance (and closeness) measures be- 
tween density matrices. In this section we aim to generalize to a pair of sequences of density 
matrices, the notions of distance introduced before to a pair of density matrices. The moti- 
vation for this is the construction of objective functions {&) for problem ( 2.77|) , introduced 



in the previous chapter as a general formulation of the problem of transforming between 
sequences of density matrices. 

For any choice of measure Ql G {D, CK, 3, Jn}, we introduce two averaging schemes 
(3 l ) l and {^) 2 i each of which providing a different quantitative estimate of the distance 
between two sequences of / > 2 d-dimensional density matrices [pi] i=1 and [cj] i=1 . These are 



Table 3.2: A summary of the properties of the studied distance measures. In the column entitled 'Monotonic under maps', we 
present the largest considered family of maps under which each measure is monotonic. The hierarchy of considered families is: 
CPTP maps > unital CPTP maps > projective measurements. The gray color indicates a conjecture supported by numerical 
evidence. In the column entitled 'Related metrics', the dashes indicate that the corresponding quantities are already metrics. 
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Pp. 54,54 
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defined as follows: 

(@(p i ,<r i )) 1 :='52iri®\Pi,<ri], (3-73) 



i=i 



{@{Ph°i)), 



i=l i=l 



(3.74) 



Here, 7Tj is a chosen probability distribution over the alphabet % = 1, . . .,1 and satisfying 
7Tj G (0, 1) and J2i=i n i = 1- The averaging scheme is simply a weighted average distance 
between the z-th elements of each sequence, while (St} 2 measures @ between a single pair 
of /d-dimensional density matrices formed by the weighted average — with respect to the 
direct sum — over the elements of each sequence. 

By suitably choosing the values of 7Tj in Eqs. ( |3.73| ) and ( |3.74 ) and running the optimiza- 



tion Q2.77D , one is actually setting a hierarchy on the desired accuracy of the implementation 
of each "atomic transformation" pi i— > 'p i . For example, if there is no precedence of an 
atomic transformation over the others, then the uniform probability distribution 7Tj = 1/1 
for every i should be chosen. On the other extreme, if 7Tj is chosen to be 1 for some value 
of i, then the corresponding atomic transformation will be the only one that matters; in 



these circumstances, the special case of single-state transformation discussed in Sec. [2.4.2 
is recovered. Since the single-state case has already been fully solved, we assume without 
loss of generality that 7Tj ^ 1 for any i. We can also assume 7Tj 7^ for every i, which is 
justified as follows: if certain atomic transformations are absolutely irrevelant, then instead 
of assigning zero weight we can simply remove the corresponding source and target states 
from the sequences. Henceforth we shall refer to the point probabilities 7Tj as priorities. 

The averaging schemes introduced above are interesting because they yield distance mea- 
sures (@) for sequences of density matrices which behave much in the same way as 3) behaves 
for density matrices. This is made more precise in the following: 

Theorem 3.2. The functions {3{pi,ai)) l2 defined in Eqs. ( |3.73j ) and ( |3.74| ) are metrics for 
the space of sequences of density matrices if 3 is a metric for the space of density matrices. 



Proof. The proof is a trivial verification of each one of the metric axioms stated on page [31]: 



(Ml) (Nonnegativity): That (2t(pi,o~i)) 1 > for all sequences [pi] i=1 and [erj] i=1 follows from 
S'iPi, o~i) > for % = 1, . . . , I and from the fact that a convex sum of non- negative terms 
is nonnegative. The nonnegativity of {@(pi,o~i)) 2 is an instance of the nonnegativity 
of @ with block- diagonal density matrices. 

(M2) (Identity of Indiscernibles) : We first prove the 'if direction' for both and {S>) 2 '- If 



2 



Ni=i = Mi=i> we nave (®(pi, = (®(pi, pi))i = J2i = and also (®(pi, °*)) 

(@(pi, Pi)} 2 = & (©j 7TiP«, ©j TTj^j) = 0. Remarkably, notice that the first equality in 
each case would not hold if we were dealing with sets (as opposed to sequences) of 
density matrices. 

Conversely, the condition (@(p i ,a i )) 1 = Yli 7r i^(Pi) a i) = requires Ql^p^Oi) = 
because 7Tj 7^ for i = Due to the metric property of @, this can only 
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happen if p^ = Oi for all i, or equivalently if [pi] i=1 = [o"t] i= i- Likewise, the condition 
{@(pi, o-j)) 2 = ^(0^ iTiPi, i vrjCj) = requires i ^p; = j 7r;(7i which is equivalent 
to pi = Oi since VTj 7^ for all %. 

(M3) (Symmetry): Symmetry of (^) 12 follows trivially from the symmetry of 

(M4) (Triangle Inequality): For three sequences of density matrices [p]f =1 , [c]f = i and [t][ =1 , 
the multiplication of each one of the valid triangle inequalities @(pi,Ti) < 2>{pi,ai) + 
^(o'i, Ti) by %i yields another set of I valid inequalities. Summing over all of them, the 
triangle inequality for is established. The triangle inequality for (@) 2 is just an 
instance of the triangle inequality for @ with block-diagonal density matrices. 

□ 

Apart from the metric axioms, many other properties of Q) are inherited by {S>) 12 . For 

example, for any sequences [pP]f =1 , [p\ ]f=i, [<j\ ]f=i> [crj- 2 ' l ]| =1 , and pi and p 2 non-negative 
numbers such that pi +P2 = 1, the inequalities 

(®{vx P f +P2P?\pM l) + p 2 <rf ) )> 1>2 < Pl (SipPM^v+P* (@(P?\°h)i,2 , (3-75) 
(or the reversed inequality) follow straightforwardly from the joint convexity (concavity) of 

Likewise, if [Qi]( =1 = [£(pi)]f=i and [q]f =1 = [£ (q)]f = i, then 

(^(ft,ft)>i |2 < {®(pi,<n))i,2 , ( 3 - 76 ) 

(or the reversed inequality) holds if @ is monotonically decreasing (increasing) under the 
map S. In particular, if S is a unitary map, saturation of inequality ( |3.76| ) can be shown to 
hold if @ is unitarily invariant. 

Due to the good properties of the distance measures T>, "K, 0, 3 and 3^n, and the 
parallelism between {&) 12 and 2 1 , we consider the measures (D) 12 , (CK) 12 , (0) 12 , (9 r )i2 
and (9 r Ar) 12 sensible choices for quantifying distance between sequences of density matrices. 
In the next chapter, they are used as the objective function of problem ( |2.77|) and some of 
the resulting problems are formulated as semidefmite programs. 



4 

Assembling Semidefinite Programs for 

Quantum Control 



4.1 Introduction 

At the end of Ch. 0, we proposed the general optimization 



min (®[C{pi),p t ]) , (4.1) 



cec d 



as a formal expression for the problem of determining a quantum operation converting be- 
tween sequences of density matrices. In the same chapter, we saw how this problem can 
be brought very close to the form of a SDP if Cj et is chosen to be either the set of CPTP 
maps, Q| et , or a relaxed version of the set of EBTP maps, £>^ ct . In this chapter, the formu- 
lation of problem ( |4.1| ) over these sets is completed with the specification of using the 
distance measures introduced in Ch. 0. Our main results are the derivation of several SDPs 
formalizing our quantum control problem. 

Once we have our SDPs assembled, we are in the position to numerically solve them and 
observe how different choices of distance measures influence the resulting optimal controllers. 
This analysis is first conducted in a qualitative basis, and restricts to some examples of 
qubit state transformations. Later on, a more general setting is considered, and preliminary 
numerical results lead to a quantitative description of the "compatibility" between a chosen 
distance measures and the remaining ones. 



This chapter is divided as follows: In Sec. 4.2 we derive the SDPs to minimize metrical 



distances related to the trace distance, D, the Hilbert-Schmidt distance 'K and the spectral 



distance 0. In Sec. 4.3 we discuss the optimization problems arising from the closeness 



measures 3 and 3^, and a SDP is derived in a particular case. Qualitative and quantitative 



comparisons between the numerical solutions of these problems are presented in Sec. (O 
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4.2 Minimizing distances 



In this section we consider the formulation of problem ( [4.1]) with {&) taken as one of the 
metrics[] (D) 12 , ("K 2 ) 1 , (3"0 2 j (0)2- Exploiting a number of algebraic tricks, we demonstrate 
how the minimization of these quantities can be cast as SDPs. 

As a first trick that will be useful for all the choices listed above, we start by reexpressing 
problem ( |4.1| ) in the equivalent epigraph form [20] 



minimize t 
subject to (2>) <t 
C e Cf , 



(4.2) 



Albeit a new variable t is introduced here, the gain is that the objective function becomes 
linear in the problem variable. Since we have already seen in Sec. |2.4.3j how to handle the 
constraint C G Q et for the cases of interest [i.e., C| et = Q^ et and Q et = £>| et ], it only remains 
to reexpress the new inequality constraint < t in the form of a linear matrix inequality. 
This is done next, case by case, for each choice of (2>). 



4.2.1 The Trace Distance 

In principle, we should start by establishing which averaging scheme, or (T>) 2 , is to be 
considered first. However, this is dispensable in the case of the trace distance since 



(<D[c{ Pi ),n)x = {nc{Pi),n) 



2 • 



(4.3) 



for all density matrices C(pi) and p i . This "degeneracy" follows from the identity Tr Ai = 
Tr 0^ Ai, valid for every set of square matrices {Ai}. In particular, if Ai is taken to be the 
diagonal matrix of the eigenvalues of 7Tj[C(pi) — pj, then we obtain Eq. ([4.3[). Because of this 
equivalence between (D) 1 and (D) 2 , henceforth we shall adopt the simplified notation (D), 
and write 



(D[C( ft ),ft]> 



i=l 



Pi 



(4.4) 



We now focus on the inequality constraint of Eq. ( |4.2| ) which, thanks to the equation 
above, is of the form ||A|| tr < t. The following lemma is then immediately applicable: 



Lemma 4.1 (Fazel-Hindi-Boyd, ||135|| ). For any square matrix A and t G 
and only if there exists matrices Y and Z such that 



< t if 



Y A 
At Z 



> and Tr Y + Tr Z < 2t , 



(4.5) 



1 Note the omission of the metrics a nd (0) 1 from the roll of metrics considered here. Unfortunately, 
we have not been able to cast problem ( [4.1[ ) as a SDP for these choices of (S>). 
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Proof. Throughout, U and V are defined via the SVD of A, i.e., A = UHV^ . We start by 
assuming that there exists matrices Y and Z such that conditions ( f4.5| ) hold and then we 
show that ||A|| tr := TrS < t. 

Recall that the trace of the product of two PSD matrices is always non-negative, so 



Tr 



Y A 

At Z 



> 



(4.6) 



for every matrix X. In particular, take — (W —V^ ) and expand Eq. fl4.6|) . Exploiting 
the cyclic property of the trace and the SVD of A one finds 2 Tr £ < Tr Y + Tr Z, which 
implies Tr £ < t. 

Conversely, suppose TrS < t and let Y = UTJJ^ and Z = VT,V'. Clearly, TtY + Tt Z = 
2 Tr S < 2t. Also note that 



Y A 
At Z 



which concludes the proof. 



(^)s(t/t vt)>o 



(4.7) 



□ 



In the light of lemma |4.1| , we obtain the following optimization problem for the mini- 
mization of (S))'- 



minimize \ (Tr Y + Tr Z) 



( 



subject to 



Y 



f [CiPt 



f 

Z 



\ 



Pi\ 



i=l 



> 



(4.8) 



i=l 

TSC 



C e C5 ct 



where we have redeemed from the epigraph form by eliminating the variable t, in such a way 
that the objective function becomes a linear function of the matrix variables Y and Z. 

The optimization problem above can already be recognized as a SDP. To make this more 
explicit (and to provide a useful form for its numerical implementation), we now reexpress 
problem (|4.8|) in the inequality form. This can be done by noting the following: 



1. Due to the positivity requirement, Y and Z can be restricted to the set of hermitian 
matrices of dimension Id. As such, they can be expanded in the bases introduced in 



Sec. 12.2.3.51, 



(Id) 2 (Id) 2 

Y=J2 y* H fd and Z = z a H? d . 



(4.9) 



Q = l 



Such a choice, reduces the objective function | (TtY + Tr Z) to the form Id(yi + zi)/2. 

2. The constraint C G Q et , reformulated in terms of the expansion coefficients x^ v of the 
Choi matrix £ in the basis {H^ <g> H^}^ u=1 , was shown in Sec. |2.4.3.1| to be equivalent 
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to 



1 



(4.10) 



u=2 



in the case of Q et = Q d et , (i.e., when the optimization runs over the set of CPTP 



maps); and in Sec. [2.4.3.21 , to be equivalent to 

l2d 2 



^=2 



(4.11) 



in the case of C| et = j6 d et , (i.e., when the optimization runs over a certain subset of the 
set of CPTP maps which is, in general, a superset of the set of EBTP maps). 



3. Combining Eqs. Q2.16Q and Q2.41 ), the density matrix C(pi) can also be expressed in 



terms of the coefficents x 



d 2 



c ^) = ^f + E^ Tr (^d)^d- 



(4.12) 



/i=i 

u=2 



Using these three facts, problem ( |4.8[ ) can be rewritten as 

Id, . 
minimize — {y± + z\) 

d 2 (Id) 2 

subject to Fp + y^y^ F(^^, y a , z a ) > 

/x=l a=l 
u=2 



(4.13) 



where F(x^ u , y a , z a ) is a linear function of the variables cc^, y a , and z a , as explicitly shown 
below 

/ 



1 
1 

l 
l 


i=l 




\ 


■ + 

l 

i=l 1 

i 

























(4.14) 
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and Fq is a constant matrix given by 
/ 



0/d 


©f(¥-*) 

i=l v 7 




0/ d 



!M2 

d 



(4.15) 



^2 

v — 

where the highlighted blocks are only considered if C| et = i3 d ct . 
4.2.2 The Hilbert-Schmidt Distance 

As already mentioned, we have been unable to write the minimization of the metric 
as a SDP. Essentially, the difficulty arises in dealing with the square-root in the definition 
of this quantity. Indeed, the minimization of the squared version {'K 2 ) 1 — which is also a 
metric (cf. Theorem !T5 and Appendix |B.4j ) — can be cast as a SDP. This is demonstrated 
in Sec. [4.2.2.1 . 



In Sec. |4.2.2.2| , we show how the minimization of { < K 2 ) 2 can also be written as a SDP. 
Since (W 2 ) 2 and (!K) 2 are monotonically related via (CK) 2 = a/ (3~C 2 ) 2 , the same SDP will 
also provide the quantum operation minimizing (IK) 2 . 



Finally, in Sec. 4.2.2.3 , we prove that if every atomic transformation is to be implemented 
with the same priority, i.e. 7Tj = 1/1 for % — 1, . . . , I, then it is irrelevant whether we minimize 
(J{ 2 ) 1 or (3~C) 2 , since both problems lead to the same optimal quantum operation. 

4.2.2.1 Minimizing (J{ 2 ) 1 

From the definition ( |3.60| ) of !K, we can write ( < K 2 ) 1 as 

(^[CG^pJh = J> {vec [C( Pi ) - - Pi ]}Uec[C{ Pi ) -p t ] 



= v\ n v c , 

where we have defined the diagonal matrix II := ©f =1 7Tjld 2 and the vector 



(4.16) 



( vec [C(pi) - \ 
vec [C(p 2 ) - p 2 ] 

\ vec - Pi] ) 



(4.17) 
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With these provisions, the inequality constraint of Eq. ( |4.2|) assumes the form 

t-vj,Uv c >0, (4.18) 
and can be reformulated as a linear matrix inequality with the aid of the following lemma: 

Lemma 4.2 (Schur complement condition for positive semidefiniteness 0, 136|| ). Let A be 

a Hermitian matrix partitioned as 

A <t £)• (4 - l9) 

in which An is square and nonsingular. The Schur complement of A with respect to An is 
defined as Sch(/l|Aii) := A22 — A\ 2 A^ A-12, and we have A > if and only if An > and 
Sch(A|A n ) > 0. 

Because the inequality ( |4.18| ) is precisely Sch(M(C, t), LI" 1 ) > for a matrix M(C,t) 
defined as 

vc 



M(C,t) 



ir 1 



(4.20) 



and because II 1 > 0, Eq. ( |4.18| ) can be reexpressed as M(C, t) > and we arrive at the 
following optimization problem: 



minimize t 

subject to 

V L 

C e Cf 1 



n- 1 v c 

4 t ri0 



(4.21; 



Just as done in the last section, by adopting the Choi matrix representation of C and 
expanding it on a tensor product basis, we obtain a SDP in the inequality form: 



minimize t 

d 2 

subject to F + x n,vFn,v + tT > 



(4.22) 



u=2 



where T = (0/ d 2©l©0 d 2) if the optimization is taken over the set Q d ct , or T = (0/ d 2©l©0 2 d 2 ) 
if optimizing over the set *B d et . The matrices F^ v and F are shown below: 



/ 



Tr ( P JH%) u v 


\ 


0/ d 2 1 : 




Tr (p]H%) u v 




Tr(pjH%)ui ■■■ Tr(p]H£)ul\ 






HI ® HI 






H»®H? ) 



(4.23) 
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Fo- 



ri 



- \it 



[vec(^-^)] 



vec ft -ft) 
vec ft -ft) 



V 



1,-. 



d 2 



/ 



(4.24) 



where it^ is shorthand notation for vec if^ and the highlighted blocks only occur if the 
optimization is taken over i3^ ct . 



4.2.2.2 Minimizing (IK) 2 or (IK 2 ) 2 

As argued before, the monotonicity between (0~C 2 ) 2 an d (3~0 2 guarantees that the quantum 
operation C minimizing CK 2 [C(p i ),p i }) 2 also minimizes CK[C(p i ),p i }} 2 . In what follows, we 
derive a SDP for the minimization of ('K 2 [C(pi),p i ]) 2 . 

Once again, we employ the definition of % from Eq. ( |3.60| ) to write CK 2 [C(p i ),p i }) 2 = 
W(,wc, where Wq is given by 



w c := vec (J)^ [C(pi 



(4.25) 



i=l 



Using wj,wc in the inequality constraint of problem (|4.2| ), we find t — wj,wc > 0, which, 
by the application of Lemma [4.2|, leads to the following equivalent optimization problem: 



minimize t 



subject to 



I 



(/d) 2 
t 



C e 



w 



Cset 
d 



t 



> 



(4.26) 



Reexpressing this problem in terms of the suitable expansions of the Choi matrix of C, 
we arrive at a SDP of the same form given in Eq. ( 4.22| ), but with different values of T, Fp V 
and Fq. In this case, T = (Orr^a © 1 © d 2) if the optimization is taken over the set Qf 1 , 

and T = (0(/d) 2 © 1 © 2 d 2 ) if the feasible set is chosen to be £> d et . The matrices F^ u and Fq, 
in turn, are defined below according to the convention that the highlighted blocks are to be 
considered only if the optimization is taken over £> d ct : 
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/ 



0(/d) 2 


7 

vec vr, Tr m) fl- 

i=l 




r i it 

L »=i 











# d ® # d 











\ 



(4.27) 



1 



(Id)3 



© 



vecH- 

i=l 



Id 

i=l 



V 



\ 



/ 



(4.28) 



While the minimal t satisfying the constraint of Eq. ( |4.22 ) gives the optimal value of (!K 2 ) 2 , 
its square root gives the optimal value of (!K) 2 . 



4.2.2.3 Monotonicity between {'K 2 ) l and {K), 



In Sec. 3.3.1.2 , we saw that the squared Hilbert-Schmidt distance between two density ma- 
trices p and a can be expressed as the sum of the eigenvalues of p — a squared, that is, 
Tr (A 2 ) for A the diagonal matrix of elements given by the eigenvalues of p — a . As a result, 
we can write 

i 

(^ 2 [C(p i ),pJ) 1 = ^^Tr(A 2 ) , (4.29) 



8=1 



for Aj the diagonal matrix of eigenvalues of C(pi) — p i . Likewise, a simple computation gives 



W(A),PJ>; 



E*?it(A?: 



(4.30) 
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Due to the different exponents of 7Tj in each equation, the relationship between ( < K 2 ) 1 and 
(9C) 2 is not, in general, monotonic. However, it is easy to see that monotonicity takes place 
if every 7Tj is equally chosen to be 1/7. In this case, the priorities can be factored out of 
the sum to give x/I in the first case and y/x/I in the second, where x := Yli=i ^ Y C^l)- As 
a result, ( < K 2 [C(pi),p i }) 1 and ( < K[C(pi),p i }) 2 are minimized with the same C in the case of 
uniform priorities. 

Notice that this is a practically useful observation. As a quick glance at Eqs. (|4.23|) 
and (|4.27 ) shows, the dimension of the matrix inequality constraint arising from the opti- 
mization of ('K 2 ) 1 scales linearly with /, while in the case of (IK) 2 the scaling is quadratic. 
It then follows that the minimizer of (Ji) 2 can be obtained with a quadratically smaller 
computational cost if 7Tj = 1/1 for all i. 



4.2.3 The Spectral Distance 

For (®) = (0) 2 , the inequality constraint of problem 
spectral norm, as follows: 



P^\ 



8=1 



< t. 



can be written in terms of the 



(4.31) 



A well-known reformulation of this type of inequality is given by the following lemma: 

Lemma 4.3. For any d- dimensional matrix A and scalar t > 0, \\A\\ < t if and only if 
A^A < t 2 t d 

Proof. First assume ||A|| := a/ A max (Af A) < t, then X mSLX (A^A) < t 2 which implies that the 
diagonal form of A^A satisfies V^A^AV < t 2 ld- This is trivially equivalent to A^A < t 2 ld- 
Conversely, assume A*A - t 2 t d < and let Umax be a normalized eigenvector of A^A such 
that AUu max = A max (AU)u max =:||A|| 2 i; max . Then ^ ax (AfA - t 2 l d ) v m!lx = \\A\\ 2 - t 2 = 
(\\A\\ + t)(\\A\\ - t) < 0, which implies \\A\\ - t<0. □ 

So, the minimization ( |4.2| ) becomes 

minimize t 



subject to tl Id - < ^ 7Tj [C(pi 



Pi 



t- 1 \ (B*t[C{pi 



Pi\ 



> 



(4.32) 



i=l 



i=l 



C e Q c 



which, after recognition of the Schur complement (cf. Lemma (4.2[ ) on the lhs of the inequality 
constraint, is equivalent to 

minimize t 



( 



subject to 



tt 



id 



[C(Pi 



\ 



Pi 



i=l 



07T i [C(p i ) -p^ 
\ i=l 

C e C s d et . 



tt 



id 



> o 



(4.33) 
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Following the same protocol from previous sections, namely, reexpressing the map C in 
terms of a suitable expansion of its Choi matrix, the problem is reduced to the inequality 
form given in Eq. (ggg) with T = l 2Id © d 2 for C d = Q| et and T = l 2/d © 2d 2 for C d 



Furthermore, the matrices F^ v and F are as follows: 



/ 



fJ,,V 



0/ d 


i 
i 

i 7 

! vr, Tr (pjfl-fl flj 

1 i=i 
i 






J 

07r,Tr(p^)^ 

j=i 


+ ■ 

i 

0/ d 

1 
1 










H» 9 HI 










H» © Hf 



\ 



(4.34) 




^2 



(4.35) 



where the highlighted blocks are considered only in the case Cf 



set 



K?SCt 

°d ■ 



4.3 Maximizing Closeness 

In this section we continue to derive SDP expressions for the control problem of interest, 
but here {3) is taken to be a measure of closeness between sequences of density matrices, as 
opposed to the metrics considered in the previous section. In particular, we shall discuss the 
maximization of the fidelity-like quantities (3 r ) 1 2 and (3^)1 21 introduced in the last chapter. 

To account for this "inversion" on how distances are measured, the general optimization 
problem ( |4.1|) must have the minimization replaced with a maximization, 



max (0[C( Pi ),Pi]> 
Cec d 



(4.36) 
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The concavity properties of (3 r ) 1 2 and (3^)1 2 ( a l° n g with the convexity of the constraint C G 
Q et ), ensure that for any choice of (@) G {(3 r ) 1 2 , (3V)i 2} we obtain a convex optimization. 
However, we have been unsuccessful in providing a SDP formulation of these optimization 
problems in the general case of arbitrary priorities and mixed states. For this reason, in 
what follows we restrict to the maximization of the functions (9 r Hs) 12 ) where 

?miC(p t ),n :=Tr[C( Pi ),pJ , (4-37) 

is the Hilbert-Schmidt inner product between the density matrices C(p«) and p t . 

The motivation for this function as a replacement for £F and 5Fjv is as follows: If the 
target sequence is exclusively composed by pure states, then we have already seen that 
both 5F and SFjv recover the Schumacher fidelity — which is, in fact, $ns- This implies that 

(3WC(Pi), = P[C(Pi)> Fi>]>i = \M)v where A = ft)<^| for all z, and 

thus the SDPs we shall obtain actually maximize any of the three quantities aboveQ. 

The situation is not as well justified when source and target sequences are mixed states. 
This can be anticipated by an evaluation of ^Jfis against the criteria of the last chapter 
(cf. Table |3.2| ). In its favor, iFfis has the properties of symmetry, unitary invariance, compli- 
ance with Schumacher's fidelity, separate linearity, multiplicativity under tensor product and 
computational complexity 0(d 2 ). However, it fails to be monotonic even under projective 
measurements and we have not been able to determine a related metric — in particular, 
none of A[5Fhs]> -BPhs] or C[5F HS ] are metrics for the space of density matrices. Even more 
serious is the fact that, in general, S^usip, c) does not achieve its maximal value when p = a. 

In spite of this, the maximization of (^hs)! 2 ^ n ^ ne case °f mrx ed states is motivated as 



follows: From the inequalities 3hs < $ < (see Refs. ]77] and ]9B] for a proof of the first 
and the second inequalities, respectively), one can easily show that 



(3ks)i l2 < (3V 2 < (^)i 



2 • 



(4.38) 



which establishes the maximum value of (^hs)^ as a l° wer bound for both (3 r ) 12 an d 
(9 r Af)i 2 - m practice, if one is only interested in a control action that guarantees a minimal 
performance (measured in terms of (3 r ) 1 2 or ($n)i 2); then the maximization of (J^hs)! 2 
provide such an operation if its optimal value is larger than the required performance. 



4.3.1 The Hilbert-Schmidt inner product 

We start noting the following similarity between the expressions of (^hs)! an d (3 r Hs)2 : 

(3 r Hs[C(p i ),p i ]) 1 = $>,Tr [C{pi)p % ] , (4.39) 



i=l 



(3 r us[C(p i ),p i }) 2 = Tr 



i=l 



.3=1 



J> 2 Tr[C(p ; 



i)Pi\ 



(4.40) 



i=l 



2 Note, however, that we cannot generally write (9 r Hs[C(/Oi), |V , i)]) 2 = (3[C(pj), | 7 / , i)]) 2 = 
(5jv[C(/9j), |V'i)]) 2 - The direct sum taken over the pure target states multiplied by 7Tj, effectively turn them 
into mixed states of a larger dimensional Hilbert space. In this case, the equivalence between 3hSi 3" an d 
Sat is generally invalid. 
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Since the only difference between the two expressions is the exponent of the priorities 7Tj, 
in the rest of this section we restrict to assemble a SDP for the objective function (^hs)]/ 
Obviously, a SDP for (Stis^ can be readily obtained from our SDP for (^hsK by simply 
replacing every occurrence of 7Tj with 7if . 

In addition, we note that just as {'K 2 ) 1 and (3~C) 2 are monotonic with respect to each 
other in the case of uniform priorities (cf. Sec. [4.2.2.3| ), we have that (^hs)! an d (^hs^ 
are proportional to each other in the same circumstances. In fact, it is easy to see from 
Eqs. an d (EH> th at (^hs)i = I (^hs) 2 if 7r» = 1/7 for i = 1, . . . , J. As a result, the 

operation maximizing (^hs)! also maximizes (^hs^ i n this case. 

For what follows, it will be useful to have two different presentations of Eq. (|4.39|) . In 
the first, we write the trace as 



Tr[C( Pi )A] = Tr[e:(p7®A 



(4.41) 



where we used Eq. ( |2.16|) and the definition of the partial trace. In the second, we substitute 
£ with the expansion of Eq. (|2.41| ) to get 



1 d2 

Tr [C{ Pi )p % ] = t + E x w Tr WpJ) Tr ( H M 



(4.42) 



Next, the formulation of SDPs over Q d ct and £> d ct is presented in an independent fashion. 



4.3.1.1 Semidefinite program for {3>) = (?hs)i and C G Q d 



set 



With the above provisions, expressing problem ( |4.36| ) as a SDP is immediate for {S>) = 
(^hsK an d C G <2 d et : The objective function is assembled from Eqs. ( [4.39|) and (|4.41|) , 
whereas the feasible set comes from the constraints of problem ( |2.79| ) to give 



maximize Tr 



,i=i 



subject to £ > 

Tr [t (Hg <g> l d )] = d5 aA for a = 1, . . . , d 2 

This is clearly a SDP in the standard form [cf. Eq. ( |2.63| )], with 
1 



Id and b c 



dS, 



a. I 



for a — 1, 



i=l 



(4.43) 



(4.44) 



4.3.1.2 Semidefinite program for = (^hsK and C G <6f 



set 



In this case, the optimization problem ( |4.36| ) is just problem ( 4.43|) with the extra constraint 
£ T2 > 0, as explained in Sec. |2.4.3.2| . Due to this addition, it turns out to be easier to derive 
a SDP in the inequality form, as follows. 
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From the expansion of Eq. Q2.41 ) for £. (and using the normalization of density matrices 
and that = Id), we find that the objective function of problem ( |4.36p can be written 
as h Si=i n i + Sm =1 x m v&u, v ( we have deliberately left the first sum unevaluated in order 

v=2 

to obtain a SDP for the maximization of (9 : hs)2 v i & the replacement 7Tj — > 7r 2 , as explained 
before), where we have defined 



tt, Tr (tf^J) Tr (tffo) . (4.45) 



Using the objective function above and the inequality constraint of Eq. ( |2.85|) , the opti- 
mization problem of interest reduces to the following SDP in the inequality form: 

1 i d 2 

- ^ n i ~ minimize x^ v (-o^) (4.46) 

i=l fj,=l 
u=2 

d 2 

subject to F + x w F w ^ ( 4 - 47 ) 

u=2 

where we have included the minus signs to reexpress the original maximization as a mini- 
mization. Furthermore, we have defined 



and = (h2 ® H^j © (# d Q ® . (4.48) 



Finally, we note that by removing the second d 2 -dimensional block from Fq and F^ v , we 
obtain an inequality form for the SDP of Eq. (|4.43|) (maximization of (^hs)! over the set of 
CPTP maps). 



4.4 Controller sensitivity to the choice of (@) 

In the previous sections we derived a number of SDPs for the problem of optimally transform- 
ing between sequences of density matrices. This multiplicity of optimization problems arises 
from the many available choices of distance measures between sequences of density matri- 
ces. In this section, these problems are numerically solved and by comparing their solutions 
we attempt to provide estimates on how different are the optimal operations (controllers) 
resulting from each problem. 

From a practical viewpoint, there is at least one good reason for the proposed analysis: 
as we will see next, some of the SDPs derived here are harder to solve than others. It is thus 
interesting to find how well the solution of an easy problem approximates the solution of a 
difficult one. 

The celebrated efficiency in solving a SDP is a consequence of the fact that the interior- 
point algorithm p0| , |66| requires only a polynomial number of operations (with respect to 
the "problem size") to find an optimal solution. More specifically, if n is the number of 
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variables of the SDP in the inequality form and m is the dimension of the matrix inequality 
constraint, then the number of necessary operations to find a solution is not larger than 
0(m 2 n 2 y/n) [18[ |66l. However, if m and/or n are large, this can be a formidable task. 

The values of m and n for the SDPs derived in the preceding section are shown in 
Table |4.1| , and a practical estimate of the time required for their solution over the set Q| et is 



presented in Fig. |4.1| for a few values of / and d. A quick glance at Fig. |4.1| and/or Table |4~1 



reveals a clear computational advantage of (Sris)^ over the metrics (T>), (!K } v (!K) 2 and 
(0) 2 . Both the table and the figure are consistent in that no change of the problem size 
occurs with a variation of I. Moreover, for a fixed value of d, (^hs)! 2 gi ves rise to the SDPs 
with the smallest values of n and m.f\ 

Table ^J] also shows that the minimization of (?C) 2 is the only one that yields a quadratic 



scaling of the dimension m of the matrix constraint with I. This is born out in Fig. |0 
where the minimization of (3~C) 2 is seen to be dramatically slower than the optimization 
of the other measures. Typically, the second slower minimization is that of (D). This is 



justified in Table 3~T, where the number n of variables involved in the minimization of (D) 
is seen to scale quadratically with J, whereas the other measures do not show any scaling of 
n with I. 

Finally, although the minimizations of both ('K 2 ) 1 and (0) 2 yield SDPs with precisely 
the same number of variables (and matrices whose dimension scale linearly with I and 
quadratically with d), the actual dimension of the matrices is larger for {'H 2 ) 1 . Once again, 
this is substantiated in Fig. |4.1| , where the minimization of {'K 2 ) l is seen to be always slower 
than that of (0) 2 . 

In the next sections, these measures are compared not from the view point of com- 
putational cost, but in terms of how different are the transformations produced by the 
optimization of each of them. 



Table 4.1: A comparative analysis of the "size" of each SDP formulated in Sees. |4.2j and |4~3 
for different objective functions (@) and feasible sets Q et . The scaling on the number n of 
variables and the dimension m of the matrix constraints is shown as a function of / and d. 
For ease of comparison, the values for / = d = 2 are shown in brackets. 





QT 


B s d ct 


(V) 

(W) a 
(0)2 


n 


m 


n 


m 


d 2 (d 2 + 2/ 2 -l), (44) 
d 2 (d 2 -l) + l, (13) 
d 2 (d 2 -l) + l, (13) 
d 2 (d 2 -l) + l, (13) 

d 2 , (4) 

d 2 , (4) 


2/d + d 2 , (12) 
d 2 (l + /) + l, (13) 
d 2 (l + / 2 ) + l, (21) 
2/d + d 2 , (12) 
d 2 , (4) 
d 2 , (4) 


d 2 (d 2 + 2/ 2 -l), (44) 
d 2 (d 2 -l) + l, (13) 
d 2 (d 2 -l) + l, (13) 
d 2 (d 2 -l) + l, (13) 
d 2 (d 2 - 1), (12) 
d 2 (d 2 - 1), (12) 


27d + 2d 2 , (16) 

d 2 (2 + /) + l, (17) 
d 2 (2 + / 2 ) + l, (25) 
2/d + 2d 2 , (16) 

2d 2 , (8) 

2d 2 , (8) 



3 It should be noted, however, that the SDP formulations of the last section are not guaranteed to be the 
best ones in each case — it is possible that the optimization of the metrics can be formulated as smaller 
SDPs. In particular, if symmetries are introduced in the problem (e.g., by restricting to source and target 
states symmetrically distributed in the Hilbert space), then symmetry reduction techniques [137] can be of 
assistance. These ideas will be put at work in the next chapter. 
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Figure 4.1: (Color online) Average time required to solve the SDPs from Sees. and 4.3 



over 



the set of CPTP maps. First, a sequence of I source and target d-dimensional density matrices is 



randomly generated. These sequences are then input to SeDuMi [135] and the time taken to find 
the optimal solution for each function {&) is recorded. The procedure is repeated for 100 different 
sequences with a fixed value of (d,I). Each point in the plots represents the mean time for a 
given pair (d, /), while the error bars represent the standard deviation associated to the averaging 
process. 



4.4.1 Qualitative analysis: The Bloch disk photo gallery 

In this section we present a sequence of plots representing the numerical solution of each one 
of the SDPs formulated in the preceding sections. Our aim is to provide a first qualitative 
analysis of how severely an optimal operation depends on the choice of distance/closeness 
measure being optimized. 

In the present analysis, we restrict to the case of qubit states d = 2. This is done with 
the intent of visualizing the output states of each optimization problem as vectors on a 
three dimensional unit ball — the Bloch ball. Furthermore, we consider source and target 
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sequences of only two states each (J = 2), in such a way that we can restrict to a plane 
within the Bloch ball — the Bloch disk. 

The specific type of transformation we look at is the purification of a pair of mixed qubit 
states. The source states are taken to be separated by a Bloch sphere angle = 90° and 
have lengths i?i,i?2 < 1- The purification task consists of an attempt to increase these 
lengths up to unit (pure states), while preserving the angle between them. We note that 
due to the choice of pure targets, the maximization of (?hs)i conducted here is equivalent 
to the maximization of (3 r ) 1 or (&n)i, as discussed in Sec. |i~3| . 

In the following plots we show the Bloch vectors of source and target states, and also 
those of the states effectively optimizing each distance measure over the feasible sets Q| et 
and £>| ct . For sake of comparison, we present separate sets of plots for situations where (i) 
Ri = i?2 and 7Ti = 7T2, (ii) R\ ^ R2 and iri = t\2 and (iii) R\ = R2 and tt\ 7^ 112- 



4.4.1.1 Unbiased purification of equally mixed states 



The case 7i~i = 1x2 = 0.5 and R\ = R2 = 0.7 is presented in Fig. [L2[ Fig. |4.2(a)| shows that 
all the metrics lead to a common operation; likewise, Fig. [4.2(b)| shows that both (Stis)! 
and (3hs)2 are maximized with another common operation. The coincidence between the 
minimizers of (CK 2 ) 1 and (3~C) 2 , and between the maximizers of (^hs)! and (Shs^; should not 



come as a surprise — in Sees. |4.2.2.3 and 4.3.1 we saw that these coincidences are inherent 
to any unbiased transformation. 



All the remaining coincidences in Fig. [4.2(a)| are somewhat unexpected, and should be 
interpreted as a peculiarity of the particular task of this section in the qubit case. Indeed, 
numerical simulations for qutrit states (under the same circumstances of mixedness of states 
and uniform priorities) do not show the degeneracy observed here. Next, we provide a half- 
technical-half-intuitive clarification of the origins of these coincidences in the qubit case. 

For p and o any qubit density matrices of Bloch vectors r and s, the eigenvalues of p 
can be explicitly calculated to be ±1/2 |r — s|, and hence D(p, a) = "K(p, cr)/V2 = 0(p,a 
1/2 1 r — s\, or equivalently, 



a 



I 1 

(D(pi, o-j)) = — = (K(pi, a i )) 1 = (0(pi, a i )) 1 = \ 2^^iVi ~ Si 
* i=i 



(4.49) 



for any sequences [pi]f =1 and [o"i]f =1 of qubit states with Bloch vectors given by [?"i]f =1 and 
[ s «]f=i- Moreover, from the eigenvalue formula of pi — <7j, it is also simple to see that 



(%(pi,(Ji)) 2 
(0(A^)>2 



ly 1 
2 z-n=i 



2 1 

717 T~ 



max 7T,; Ti 



(4.50) 
(4.51) 



Now, note that Eqs. ( |4.49p , ( f4.50| ) and ( f4.51| ) all become proportional to each other if 
we make 7Tj = 1/J and assume |t\ — Sj| = k for some constant k. Of course, with these 
extra constraints, the optimization of any of the quantities above would lead to a common 
operation. That is it for the technical part. 

Intuitively, the extra constraints found above can be incorporated into the problem of 
interest without loss of generality: First, 71^ = 1/1 is already there by hypothesis. Second, 



4.4 Controller sensitivity to the choice of {Si) 



75 



since all the source Bloch vectors have the same length and all the target Bloch vectors 
have the same length (in particular, equal to one), it would be very odd if the Bloch vectors 
rj arising from the optimization of any metric would not dispose perfectly symmetric with 
respect to their target vectors. Assuming that this oddness would never occur, we can 
include the "redundant constraint" |t*» — Sj| = k for every i. Hence — under this intuitive 
assumption — the control problem should really be insensitive to the choice of metric, as 
Fig. [4.2(a)| demonstrates it is. 

A comparison of Figs. [4.2(a)| and 4.2(b) , suggests that the maximization of fidelity-like 
quantities tends to provide an improved elongation of the lengths of the source vectors than 
the corresponding elongation arising from the minimization of the metrics. On the other 
hand, the minimization of the metrics give operations that better approximate the angle 
between the target vectors. 

Finally, it is interesting to compare how the restriction to the set of EBTP maps affect 
each case. The minimization of the metrics over the set of EBTP maps leads to Bloch 
vectors that are approximately 5.3% shorter and separated by an angle 9.2% smaller than 
the lengths and angles arising from the minimization of the same metrics over the set of 
CPTP maps. On the other hand, the same restriction for the fidelity-like quantities leads 
to vectors that are actually 1% longer than the corresponding vectors from the CPTP case, 
however, as it should be the case, this is compensated with a substantial angle drop of 31%. 




Figure 4.2: (Color online) Comparison between the Bloch vectors resulting from the numerical 
optimization of (T>), ('K 2 ) 1 , (W) 2 > (0)2 an d (90 1 2 f° r the unbiased transformation (i.e., tti = 7r 2 = 
1/2) of a pair of mixed source states (dashed, blue) with R\ = R2 = 0.7 into a pair of pure states 
(dashed, red). The dash-dotted (black) arrows designate the optimal Bloch vectors arising from 
the optimization over the set of CPTP maps, while the dotted (green) arrows refer to the optimal 
transformation over the set of EBTP maps. 
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4.4.1.2 Unbiased purification of states of different mixedness 

The case iri = 7r 2 = 0.5, R\ = 0.7 and _R 2 = 0.9 is presented in Fig. fOj Notably, by 
removing the symmetry of equally mixed source states, most of the degeneracies observed 
in the last section are removed. The only exceptions are (K 2 ) x and (IK) 2 [Fig. |4.3(b)|| , and 



(^hs)! an d (I^hs^ [Fig- |4.3(d)| |, where the degeneracies survive due to the unbiased choice 



of priorities, as discussed before. 

Once again, we find that the minimizers of the metric quantities perform better at ap- 
proximating the target angle, while the maximizers of the fidelity-like quantities are better 
at approximating target lengths. In fact, it is now possible to see a smoother transition of 
this behavior while following the sequence of plots (D) — > (K^)^ 2 ~^ (^2 — * (J'hs)i 2 i R 
Fig. |4.3| . Noticeably, the CPTP map minimizing (D) yields the shortest vectors, but more 
widely separated. This is followed by the optimal CPTP for (IK 2 ) 1 or (IK) 2 , which gives 
slightly longer vectors, but separated by a smaller angle. Following the same trend comes 
(0} 2 and finally (3 r Hs) 12 , which gives the longest vectors separated by the smallest angle. 

When the restriction to EBTP maps is made, the same pattern applies for the angles, 
which are seen to decrease along the way. However, the length of one of the vectors breaks the 
pattern by decreasing while we progress along (D) -> (:K (2) ) (1)i2 -> (0) 2 . Nevertheless, even 
in the EBTP case, the maximal length of both vectors is achieved with the maximization of 

(^ks) in- 
still regarding the restriction to EBTP maps, we note that the same rule observed in the 
last section still applies. For all metrics, the output vectors are shorter and less separated 
than the corresponding vectors in the CPTP case. Only for the fidelity like measures, we 
have an lengthening of the vectors and a more substantial decrease of angle. 



4.4.1.3 Biased purification of equally mixed states 

The case m = 0.3, 7r 2 = 0.7 and R\ = R2 — 0.7 is presented in Fig. PO. Here, all the 



degeneracies are removed: Biasing not only destroys the monotonicity between ( r K 2 ) 1 and 
(IK) 2 and the proportionality between (!?hs)i an< ^ (^hs)^ but in setting a higher hierarchy 
to the atomic transformation of % = 2 it also breaks the (intuitively expected) symmetry 
between the effective Bloch vectors and the targets noted in Fig. [4.2| . 

Nevertheless, the optimal CPTP transformations preserve some important features from 



the unbiased cases. From the case of equally mixed sources (Sec. |4.4.1.1| ), we note the 
commonality that the two states resulting from the optimization of each measure have the 
same length. This shows that it is only via an adjustment of the angle between the effective 
and target states that biasing is accounted for, as opposed to a possible enhanced lengthening 
of the vector of higher weight. 



From the case with different degrees of mixedness (Sec. 4.4.1.2 ), we note that the sequence 



of measures leading to vectors of increasing length and decreasing angle is still approximately 
the same: (K 2 ) 1 -> (D) -> (K) 2 -> (0) 2 -> (I?hs)i -> (3 = hs) 2 - The difference is that now 
it is the minimization of (K 2 ) x that yields the shortest and most angularly separated Bloch 
vectors. 

None of the above holds for the optimal EBTP transformations. In this case, biasing is 
accounted for not only by an adjustment of angle, but also by making longer the Bloch vector 
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Figure 4.3: (Color online) Comparison between the Bloch vectors resulting from the numerical 
optimization of (D), (!K 2 } 1 , (5C) 2 > (0)2 an d (9^1 2 f° r the unbiased transformation (i.e., 7ri = 712 = 
1/2) of a pair of mixed source states (dashed, blue) with R\ = 0.7 and R2 = 0.9 into a pair of pure 
states (dashed, red). The dash-dotted (black) arrows designate the optimal Bloch vectors arising 
from the optimization over the set of CPTP maps, while the dotted (green) arrows refer to the 
optimal transformation over the set of EBTP maps. 
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arising from the transformation of higher weight. Moreover, although (fris^ an d (^hs): are 
still easily identified as the measures that give the longest vectors separated by the smallest 
angles, it is not so clear how the metrics should be ordered because the angles do not always 
decrease as the lengths increase. As a general observation, we have the metrics (D), (Jt} 2 and 
(0) 2 leading to vectors which have approximately the same lengths, but the angle between 
them decreases in the sequence (0) 2 — > (!K) 2 — > (D), which is actually reversed with respect 
to the ordering found in Sec. f4.4.1.2| . 

It is also worth noting how the angle between the two vectors drop as we go from the 
optimal CPTP transformation to the optimal EBTP transformation with respect to a fixed 
measure. Consistently with the previous sections, the highest angle drops occur for the 
fidelity-like measures, and is of approximately 24%. Second in the rank is (D) with a drop 
of 18.3% — much higher than the observed in the unbiased cases. The measures ('K 2 ) 1 and 
(3~0 2 come with an approximately equal angle drop of 10%. Finally, we have (0) 2 , with only 
2.7%. As expected, the largest angle drops occur for those measures that, in the EBTP case, 
attempt to lengthen the Bloch vectors with respect to the vectors obtained in the CPTP 
case. This property — which was noticed just in the fidelity-like measures in the unbiased 
cases — is now also detected in (D) (and very slightly in (3~0 2 ). 



4.4.2 Quantitative analysis 

In the previous section we looked at some Bloch disks to visualize discrepancies between the 
optimization of different distance measures for a common purification problem. From the 
plots, we have seen that the extent to which these discrepancies occur varies according to 



which two measures we choose to compare. For example, a comparison between Figs. [1.3(a 



and [4.3(b)| reveals a much closer resemblance than a comparison between Figs. [4.3(a)| and 



4.3(d)| . This suggests that — for the specific control problem under consideration — the con- 
troller that solves the minimization of (T>) is not so different from the controller minimizing 
('K 2 ) 1 , at least not as much as the controller maximizing (3hs)i- 

In this section, we attempt to make this idea of "resemblance between optimal controllers" 
a little more formal and less qualitative, in such a way to enable analogous comparisons 
between the role of different distance measures for problems involving higher dimensional 
systems and/or a larger number of atomic transformations. Our ultimate goal is to sort our 
distance measures in a decreasing order of compatibility with respect to a chosen reference. 
For example, if (T>) is taken to be the reference, we would like to know how to order the 
remaining measures in such a way that the optimization of the first element in the list yields 
the controller which is, in some sense, the closest one to that produced by the minimization 
of (2)). In practice, such a list should provide the guidelines for choosing a computationally 
cheaper optimization problem to replace a more expensive one. 

We quantify the closeness between optimal controllers according to the following con- 
struction: Start by choosing a particular convertibility problem, i.e., for some chosen values 
of / and d select sequences of d-dimensional density matrices [pi]( = i and [7Jj]f =1 . For defi- 
niteness, let (D) be our reference measure, and (D)* its minimal value for the problem at 
hand. The most compatible measure (^) G {(CK 2 ) 1 , (3~0 2 , (0) 2 , (^hs)! , (3 r Hs) 2 } with respect 
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Figure 4.4: (Color online) Comparison between the Bloch vectors resulting from the numerical 
optimization of (D), (CK 2 ) 1; (!K) 2 , (0) 2 and (3~) 12 for a biased transformation with 7Ti = 0.3, 
7T2 = 0.7 of a pair of mixed source states (dashed, blue) with R\ = R2 = 0.7 into a pair of pure 
states (dashed, red). The dash-dotted (black) arrows designate the optimal Bloch vectors arising 
from the optimization over the set of CPTP maps, while the dotted (green) arrows refer to the 
optimal transformation over the set of EBTP maps. 



to the specific convertibility problem at hand, is defined to be the one whose optimal con- 
troller yields a value of {D) [denoted (2)(V))] such that (D(^)) — (T>)* is the smallest over 
all possible choices of (V) (the second, third, etc. positions being decided in the obvious 
way). Throughout, we shall refer to the difference (X(^)) — (X)* as the performance drop 
in units of (X) due to the optimal controller for (V), or, for brevity, A(X|y). From the 
above construction, it should be clear that the value of A(X|V) is not only dependent on 
the choices of reference measure (X) and replacement measure (V), but also on the specific 
choice of states involved in the transformation we want to implement. 

In the following subsection, we present the details of a numerical analysis (based on 
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unbiased transformations of random sequences with d = 2, 3, 4 and / = 2,3,4) that led to 
averaged values of A(X|^) over many transformations. These results suggest the "typical 
compatibility orderings" proposed in Table |4.2| . Because these orderings were identified 
from the consideration of only unbiased transformations, we avoided redundancies and did 
not include (%) 2 and (3 r H s> 2 in the Table [1| (cf. Sees. [4.2.2.3| and pHP . 

Interestingly, we found that these typical orders do not seem to depend on the dimension 
d of the quantum system, nor on the number I of atomic transformations involved. However, 
our results along this direction are still preliminary and further numerical support would be 
required before more reliable conclusions could be drawn. 



Table 4.2: Conjectured compatibility orderings between distance measures based on unbiased 
state transformations between randomly generated sequences of d = 2, 3, 4 and 7 = 2,3, 4. 



Reference 


Decreasing order 


measure 


of compatibility 


(V) 

(0>2 
(^HS)! 


(0) 2) <3m)i 
<0) 2) CD) 

Mi,(B)>(H 

(2)),(^ 2 ) 1 ,(0) 2 



Note that in the first line of Table [4.2| , the order of compatibility with respect to (D) 
reproduces the sequence of measures (found on Sec. |4.4.1.2| ) that yields Bloch vectors of 
increasing lengths and decreasing angles for the case I = d = 2. This is a nice property, since 
the similarity between the Bloch vectors noted in that section could be regarded as a measure 
of compatibility for qubit state transformations. It is then interesting (and reassuring for 
the establishment of a generalized notion of compatibility between distance measures) to 
find that even in more general transformations, classified by a more general compatibility 
measure, the same order is still observed. 

From the first and third lines of Table j~2| , we note that (0~C 2 ) 1 is the most compatible 
measure to the metrics (2)) and (0) 2 . The high compatibility between ( l H 2 ) 1 with (D) is 
particularly significant: Because ( < K 2 ) 1 is quicker to compute than (T>) (cf. Fig. [4.1| and 
Table |4.1| ), its minimization can be regarded as an efficient estimator of a minimizer for (T>). 
Similarly, from the second line of Table [OL we see that (0) 2 is the most compatible measure 
to { < K 2 ) 1 , therefore analogous conclusions apply. 

Finally (and somewhat sadly), we note that (^hs)! is the less compatible measure with 
any of the metrics. Once again, this was already seen in the less general context of the 
previous section. To mention some very rough estimates, we found that the performance 
drop A(X|3 r Hs) is usually of the order of 10% for any choice of metric X if we consider 
transformations involving targets of pure states. This drop can get as high as 50% in the 
more general case of transformations from mixed to mixed states, in which cases ^hs is not 



a well motivated distance measure, as explained in Sec. 4.3 



Construction of Table |4.2| . We only consider unbiased transformations of problems in- 
volving quantum systems of dimension d = 2,3,4 and a number of atomic transformations 
/ = 2,3,4. For each one of the nine pairs (/, d) that can be constructed, we computed the 
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performance drops A(X|y) for all possible combinations of X and V in 300 different transfor- 
mations. From these, 100 transformations were chosen to be between randomly generated 
sequences of pure states, 100 from randomly generated sequences of pure states to randomly 
generated sequences of mixed states and 100 between randomly generated sequences of mixed 
states. 

We then computed the average value and standard deviation of AX|^ for each fixed 
choice of X and V, and for each type of transformation. Table [O] shows the results for the 
reference measure X = T> and the transformations where the source and target sequences 
were made out of mixed and pure density matrices, respectively. In Fig. [4.5| the results 
of Table [O] are repeated and extended to account for other choices of reference measures 
(indicated in the horizontal axis), but still in the case of transformations from mixed to pure 
states. The vertical axis of each plot indicates the percental value of A(X, V), and each bar 
represents a choice of ^, as indicated in the code shown in the middle plot on the first line. 
From the observation of this plot (and others arising from other types of transformations), 
the compatibility orderings of Table (|4~2] ) were constructed. 



Table 4.3: A quantitative analysis of the compatibility of (D) with respect to the other 
measures considered. The numbers in the table correspond to the mean values and standard 
deviation of A(D|'y) averaged over 100 randomly generated unbiased transformations from 
mixed source sequences to pure target sequences. The choice of ^ is indicated in the firs line 
of each column. 
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Figure 4.5: Estimates of optimal controllers compatibility for transformations between se- 
quences of mixed and pure states. In each plot, the vertical axis gives the performance drop 
A(X|y) averaged over one hundred randomly generated transformations between sequences of den- 
sity matrices. (X) is the reference measure specified in the horizontal axis, and (y) is one of the 
possible replacement measures, as specified in the middle plot on the first line. Each one of the 
nine plots corresponds a fixed value of / and d, indicated in the external set of axis. The error bars 
give the standard deviation of the averages. 
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To a large extent, this chapter reproduces Ref. @. Some minor notational changes were 
made in order to make the chapter consistent with the notation adopted in the remainder of 
the thesis; in addition, a few references were included and updated. More significantly, new 
scientific results obtained after the publication of @ were included here as Sec. [5.3. 2| . This 
led to a few minor additions in other parts of the text, in order to integrate the new results 
with the pre-existing material. 

From a mathematical viewpoint, the main result of this chapter is an analytical solution 
for the problem of optimally converting between sequences of two qubit states (J = d = 2) in 
a particular setting: the target states 'p i are taken to be certain non-orthogonal pure states 
and the source states are taken to be pi = £ p (pj), where S p is a dephasing map. The notion 
of optimality is captured by the maximization of (S^hs)!; which is identical to (3 r ) 1 due to 
the purity of the targets. For ease of notation, we will denote it simply by (3 r ). The optimal 
solution over both feasible sets Q| et and £>| ct is obtained analytically. 

Physically, the problem is phrased in terms of stabilizing an uncertain preparation of 
a qubit against dephasing noise. Candidate strategies are initially proposed on a physical 
basis, and subsequently proved to be optimal with the SDP machinery introduced in Chapter 
2. The optimal operation over Q| et is motivated by the idea of using feedback control to 
optimize some quantum mechanical trade-off between information gain and disturbance. The 
optimal operation over £>| ct , in turn, arises from the classical control paradigm of maximizing 
the information gain. 

5.1 Introduction 

Any practical quantum technology, such as quantum key distribution or quantum computing, 
must function robustly in the presence of noise. Many modern "classical" technologies 
tolerate noise, faulty parts, etc., by relying on feedback control systems, which monitor 
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the system and use this information to control its state. Given the ubiquity and power 
of feedback control for classical systems, it is worthwhile investigating how such control 
concepts can be applied to quantum technologies as well. However, strategies for quantum 
control must take into account some fundamental features of quantum mechanics, namely, 
restrictions on information gain, and measurement back-action. 

Classically, it is possible in principle to acquire all the information about the state of 
a system with certainty by using sufficiently precise measurements. That is, the state of a 
single classical system can be precisely determined via measurement. For quantum systems, 
however, this is not always possible: if the system is prepared in one of several non-orthogonal 
states, no measurement can determine which preparation occurred with certainty. 

In addition, for quantum systems, monitoring comes at a price: any measurement that 
acquires information about a system must necessarily disturb it uncontrollably. This feature 
is often referred to as back-action — the fundamental noise induced on a system through any 
measurement, which maintains the uncertainty relations. This feature of quantum measure- 
ment is also distinct from the classical situation, wherein measurements that do not alter 
the state of the system can in principle be performed. 

These two fundamental features of quantum systems — that non-orthogonal states can- 
not be perfectly discriminated, and that any information gain via measurement necessarily 
implies disturbance to the system — require a reevaluation of conventional methods and 
techniques from control theory when developing the theory of quantum control. 

In this chapter, we investigate the use of measurement and feedback control of a single 
qubit, prepared in one of two non-orthogonal states and subsequently subjected to noise. 
Our main result is that, in order to optimize the performance of the control scheme (as 
quantified by the average fidelity of the corrected state compared to the initial state), one 
must use non-pro jective measurements with a strength that balances the trade-off between 
information gain and disturbance. 

Belavkin was the first to recognise the importance of feedback control for quantum sys- 
tems and describe a theoretical framework for analysing both discrete and continuous time 



models ||, |139|| . Despite this early start, it is only recently that the degree of control and 



isolation of quantum systems has progressed to the point that the experimental exploration 



of quantum control tasks has been possible |L2|, [13|, [14], [L5|, [L6[ [17j, and the field is now 
undergoing rapid development (see for example |140| ). 

The specific control problem we are interested in here is the stabilization against noise 
of states of a single two level system. Similar problems have been considered in continu- 
ous time feedback models, e.g., the stabilization of a single state of a driven and damped 
two-level atom [ 141 , |142|| and the maintenance of the coherence of a noisy qubit using track- 



ing control ||143|| . Several recent papers have investigated state preparation and feedback 



stabilization onto eigenstates of a continuously-measured observable in higher-dimensional 



systems '. 14, 145 



In contrast to these prior investigations, we investigate a feedback scheme to stabilize two 
non-orthogonal states of a two-level system. We work in a discrete-time setting, rather than 
continuous-time as considered in most prior work, which considerably simplifies the problem 
and most clearly illustrates the central concepts. In significant earlier work in a discrete 
time setting, Barnum and Knill proposed near-optimal strategies to correct ensembles of 
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orthogonal states after a general noise process [146]. While Gregoratti and Werner have 
investigated this kind of model of recovering the state of the system after interaction with 
the environment [ 147 , 14811 , their investigation considered the case where it is possible to 
make measurements on the environment. In our setting we imagine that the environment 
that causes the initial decoherence is not available subsequently for the feedback protocol. 
Very recently, Ticozzi and Viola [ 149 | have applied both dynamical decoupling and feedback 
methods to suppress unwanted dynamics of a single qubit in discrete time. 

Our main interest is to investigate the effects of the kind of trade-off between infor- 
mation and disturbance that is ubiquitous in quantum information in a concrete optimal 
control problem. Related information-disturbance trade-offs in quantum feedback control 
are discussed in ||. Finally, we note that implementing quantum operations on a single 
qubit through the use of measurement and feedback control as considered here has been 
investigated for eavesdropping strategies in quantum cryptography [|150|| and for engineering 
general open-system dynamics [|151 [. 

Note that there is a fundamental difference between the kind of quantum control problem 
we are considering here and the related task of quantum error correction. (For an introduc- 
tion to the latter, see ||21| .) The essence of quantum error correction is to encode abstract 
quantum information into a physical quantum system and to choose degrees of freedom that 
are unaffected by the relevant noise, or upon which errors can be deterministically corrected. 
However, it can be the case that one wishes to protect particular physical degrees of free- 
dom of quantum systems and one is not free to choose an arbitrary encoding. (One such 
example is reference frame distribution via the exchange of quantum systems ||152|| ). The 
quantum states required for these schemes cannot be encoded into quantum error correcting 
codes or noiseless subsystems [153|; protecting such systems from noise may therefore be an 



application of this kind of quantum control. 

The chapter is structured as follows. In section |5.2j , we define the control task in detail; in 
section |5.3| , we present and determine the performance of control strategies based on "clas- 
sical" concepts. Section [5]4| introduces our quantum strategy, investigating the use of weak 
quantum measurements, and analyses its performance against the strategies of section O . 
We also demonstrate that our quantum control scheme is optimal for the task at hand. In 
section we discuss the implications of our result and their relevance to other problems. 



5.2 A Simple Control Task 

The aim of this chapter is to explore the key issues we will confront when applying concepts 
from control theory to finite-dimensional quantum systems. In order to facilitate the analysis 
and to be able to concentrate on the key departures from classical control, we will choose a 
very simple quantum system and noise model. The emphasis is not towards a practical task, 
but as an illustrative example. 

Consider the following operational task: a qubit prepared in one of two non-orthogonal 
states lipi) or |?/> 2 ) (with overlap (V'iIV^) = cos# for < 9 < 7r/2) is transmitted along a 
noisy quantum channel. Without knowing which state was transmitted, we will attempt to 
"correct" the system, i.e., undo the effect of the noise, through the use of a control scheme 
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Figure 5.1: Schematic of a quantum control procedure. A qubit, subjected to dephasing noise, 
is subsequently measured and corrected based on the results of this measurement. The output state 
p is compared with the input state \ip) to characterise how well the scheme performs. 



based on measurement and feedback []; see Fig. |5~T| . 

The noise model that we will consider is dephasing noise. Let { |0) , 1 1) } be a basis for the 
qubit Hilbert space, and the Pauli operator Z is the unitary operator defined by Z\0) = |0), 
Z\l) = — 11). Dephasing noise is characterized as follows: with probability p a phase- flip 
Z is applied to the system, and with probability 1 — p the system is unaltered. The noise 
is thus described by a quantum operation pi] , i.e., a completely-positive trace-preserving 
(CPTP) map £ p , that acts on a single-qubit density matrix p as 

£ p (p)=p(ZpZ) + (l-p)p. (5.1) 

We will consider the noisy channel to be fully characterized, meaning that p is known and 
without loss of generality in the range < p < 0.5. 

We will choose the two initial states to be oriented in such a way that their distinguisha- 
bility, as measured by their trace distance, is maintained under the action of the noise. It is 
straightforward to show that this condition is satisfied by the states 

=cosf|+) + sin§|-), (5.2) 
fe) =cosf|+)-sinf|->, (5.3) 

where |±) = (|0) ± \l))/y/2. 

Consider the Bloch sphere defined by states |0) and |1) as the poles on the z-axis. The 
two states |^q) and \vjj 2 ) lie in the x—z plane and straddle the equator of the Bloch sphere by 



angles ±(9; see Fig. |57|. On this Bloch sphere, the dephasing noise acting on these states has 
the effect of decreasing the x-component of their Bloch vectors. The trace distance between 
these two states, given by the Euclidean distance between their Bloch vectors, is invariant 
under this dephasing noise. 

We now consider whether there exists a control procedure C (some "black box" ) that can 
correct the state of this system and counteract the noise, at least to some degree, independent 



Our use of the term "feedback" based on the measurement of a system refers to a subsequent operation 
performed on the same system (as opposed to a different, identically-prepared system). This use of the 
term is standard in the quantum control literature; however, the term "feedforward" is occasionally given 
the same meaning in the quantum computing literature (as the operation is applied forward in the quantum 
circuit). For the purpose of this thesis, we can consider both terms as synonymous. 
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Figure 5.2: (Color online) Bloch sphere representation of the initial states, and the states after 
the noise. The noise shortens the Bloch vectors along the x-axis. We have used the notation £{ip) 
shorthand for £ (ip\) . 



of which input state was prepared. To quantify the performance of any such procedure, we 
will use the average fidelity to compare the noiseless input states |^) with the corrected 
output states pf x . Assuming an equal probability for sending either state j^) or |-0 2 ) , the 
figure of merit is 

{^) = \m 1 ), P p + \m 2 ), P p 

= K?ilPi x |^i) + |(? 2 |P2 X fe), (5-4) 

where the fidelity between a pure state \ip) and a mixed state p is denned as 5F(|^),p) = 
(i/j\p\ip). The fidelity 3" ranges from to 1 and is a measure of how much two states overlap 
each other (a fidelity of means the states are orthogonal, whereas a fidelity of 1 means the 
states are identical). It has the following simple operational meaning when the input state 
is pure: the fidelity J(\ip),p) is the probability that the state p will yield outcome from 
the projective measurement {|^)(^|, \ip ± ){4> ± \}- 

Thus, the aim is to find a control operation, described by a CPTP map C independent 
of the choice of initial state, such that the corrected states 

pf x = C[S p (\A)(^\)}, (5.5) 

for i = 1,2 are close to the original states as quantified by the average fidelity. We consider 
control operations that consist of two steps: a measurement on the quantum system, followed 
by a feedback operation that is conditioned on the measurement result, as shown in Fig. 1571 . 



5.3 Classical Control 

In this section, we introduce three types of control schemes for this task, all of which are 
based on classical concepts, and we calculate the performance of these schemes based on the 
average fidelity. 
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5.3.1 Deterministic Discriminate and Reprepare 

For the control of classical systems, it is always advantageous to acquire as much information 
about the system as possible in order to implement the best feedback scheme. In line with 
this principle, a possible control strategy would be to perform a measurement on the system 
which attempts to discriminate between the input states, and then to reprepare the system 
in some state based on the measurement result. Three types of discriminate- and- reprepare 
schemes are investigated here and in the next section. 

An important constraint imposed on the two schemes of this section is that every possible 
measurement outcome points to some initial preparations and is followed by the reprepara- 
tion of some suitable state. Strategies of this sort are termed deterministic. On the other 
hand, it is possible to design schemes where some measurement outcomes do not suggest any 
initial preparation; for these, only the cases where discrimination step succeeds contribute 
to the performance. Such schemes are termed stochastic and will be investigated in the next 
section. 

We first characterize all possible deterministic discriminate-and-reprepare schemes; such 



schemes are associated with entanglement breaking trace preserving (EBTP) maps [f45| , |46 
as follows. Any discrimination step is described by a generalized measurement, (or positive 
operator- valued measure (POVM)) |21| yielding a classical probability distribution. The 
generalized measurement is described by the operators {P a } with P a > and Yl a = ^2- 
The resulting map on the quantum system is called a quantum- classical map QC [f43| , given 
by 

QC(p) = ^Tr(pP a )|e a )(e a |, (5.6) 

a 

where {|e a )} is an orthonormal basis. The reprepare step, in which the quantum system is 
re-prepared based on the classical measurement outcome, is described by a classical- quantum 
map CQ []43|1 , given by 

CQ(p) = ^Tr(p|e b )(e b |)g fe , (5.7) 
b 

where {Qb} are density matrices. 

The concatenation (CQo QC)(p) leads to a map of the form 

B{p) = J]Tr(pP 6 )g 6 . (5.8) 

b 

This map is an entanglement breaking channel. The name arises because the output system is 
unentangled with any other system, regardless of its input state. In fact it is straightforward 



to see from J|5|, f46| that all EBTP maps can be realised by some discriminate-and-reprepare 
scheme. Thus these EBTP maps formalize our notion of deterministic discriminate-and- 
reprepare strategies. 

The measurement for discriminating two (possibly mixed) preparations given by Hei- 



st rom |52 is optimal in terms of maximizing the average probability of a success. For 
our choice of states, Helstrom's measurement is a projective measurement onto the ba- 
sis {|0), |1)}, which successfully discriminates the states and \ip 2 ) with probability 
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-pHei = |(1 + sin6>). Note that because of the particular choice of dephasing noise, this 
success probability is independent of the noise strength p. 

We now present and analyse two possible discriminate-and-reprepare strategies, both of 
which are based on Helstrom's measurement. 

Deterministic Discriminate and Reprepare Scheme 1: 

With the outcome of Helstrom's measurement, one strategy is to reprepare the qubit in 
either state or \i[) 2 ) based on this measurement outcome. This scheme yields an average 
fidelity 

(3*>DRi) = p Hd x 1 + (1 - PhcO x |(^fe)| 2 

= 1-1 (sin 2 9 - sin 3 6) . (5.9) 

Such a replacement ignores the fact that the discrimination step can fail, with probability 
1 — Pneh i n which case a prepared state l^i) would be reprepared as |^ 2 ) ( or v i ce versa). 

Deterministic Discriminate and Reprepare Scheme 2: 

We can consider other strategies that reprepare different states so as to reduce the effect 
of the aforementioned error. In particular, we now demonstrate that the following pair of 
states maximizes the average fidelity: 

l*± DR > = \fl±W\o) + \f\r^\i) , (s.io) 

where 7 = V sin 4 9 + cos 2 9. Note that this replacement is also independent of p. Here, |^+) 
is prepared if the measurement outcome corresponds to l^i), and |^_) is prepared otherwise. 
In this strategy, the reprepared states are slightly biased towards the alternate state to that 
suggested by the measurement (smaller 9) — in a sense hedging our bet. As a proof of the 
superiority of this scheme over the former, the fidelity 

(^ DDR2 ) = Phci x (||(^|^ DR )| 2 + !K^i* DDR }| 2 ) + (5-n) 

(l-PHel)x(||(^|^ DR )| 2 +i|(^|^ DR )| 2 ) 

= - + -Vcos 2 9 + sin 4 9, (5.12) 

satisfies (a™ 112 ) > (?- DDR1 ) for all 9. Both (J™ 111 ) and (S^ 0112 ) are presented in Fig.Qa). 

This second discriminate-and-reprepare scheme is in fact the optimal deterministic dis- 
criminate and reprepare scheme, in that it achieves the highest average fidelity 

2 

i=i 



where the maximization is over all EBTP maps B acting on a single qubit. This optimization 
was performed (in a different setting) by Fuchs and Sasaki |47 |. In the Appendix |C.2| , we 
provide an alternate proof of optimality using techniques from convex optimization. 
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Figure 5.3: (Color online) The performance of the schemes, quantified by the average fidelity, 
as a function of the amount of noise p and the angle between the input states 8. a) Deterministic 
discriminate and reprepare scheme quantified by the average fidelity (9 rDDR2 ) of Eq. ( |5.12|) . The 
fidelity (3~ DDR1 ) of Eq. ( [5.9D is shown as a solid line at p = 0.5. Both average fidelities (9 rDDR2 ) and 
(9~ DDR1 ) are independent of p. b)"Do nothing" scheme, quantified by the average fidelity (3 rDN ) of 
Eq. (|5.22|) . For this scheme, the average fidelity drops to (3~ DN ) = 1/2 for p = 0.5 and 9 = 0. c) 
Stochastic classical and deterministic quantum control schemes, quantified by the average fidelities 
(grSDR^ = ^QCopt^ Q f Eqs . (|5.21[) and (|]||. The range of fidelities plotted has been made identical 



(J QCopt ) of Eqs. (l5~2ll) and (p| 
in all the figures to aid comparison. 



5.3.2 Stochastic Discriminate and Reprepare 

In this section we propose a particular discriminate-and-reprepare scheme in which the dis- 
crimination step can produce one of three different outcomes. In two of them, we get a 
suggestion of what the initial preparation was and suitably reprepare the system. In con- 
trast, no suggestion is conveyed when the third outcome occurs, in which case we simply 
declare our ignorance and do not reprepare any state. The performance of the scheme is 
computed considering only the random occurrences of suggestive outcomes, and for this 
reason, the scheme is said to be stochastic or post-selected. 

The scheme studied here can still be modeled by an entanglement breaking map, how- 
ever not a trace preserving one. The trace preserving condition is relaxed to account for 
the fact that sometimes no output state is produced, which, on average, leads to a map 
that outputs "density matrices of trace less than one". In fact, the set of entanglement 
breaking trace decreasing (EBTD) maps characterize all possible stochastic discriminate- 
and-reprepare schemes. The particular scheme proposed in the following is relevant because 
it seems to be optimal over the set of EBTD maps. In particular, it arises from the POVM 
that maximizes the success rate of discrimination for a certain fixed fraction of inconclusive 
results | |154j , |155| |, followed by a replacement resulting from an optimization procedure via 
Langrange multipliers. 

Motivated by the results of Refs. 



154| , |155 |, we propose the following POVM for the 



implementation of the discrimination step, 



n n 



IT 



1 



f J 2 + x) 



1 



-r x X± 



(5.14) 
(5.15) 



where r 7 



2p) cos 9 is the x component of the Bloch vector describing the system after 
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the noise. 

The outcomes '+' and '— ' are interpreted to be suggestive of the initial preparation 
and ^2)) respectively^. As in the second deterministic discriminate-and-reprepare scheme of 
the previous section, we acknowledge the possibility of a misleading suggestion by repreparing 
states which are not precisely or k/> 2 ), but slightly biased to the alternate state to that 
suggested by the measurement. In this case, the replacements are, up to a normalization 



|*f R ) = ( I f I |0> + |1> , (5.16) 



factor, 



tC + Vi + C 

where ( is implicitly defined as a function of 9 and p according to £ = tan 9 sin 9/ \J 1 — r | . 
It follows from this that now the repreparation depends not only on the initial states, but 
also on the details of the dephasing map; this fact being in contrast with the replacements 
used in the classical deterministic schemes introduced in the previous section. 

An outcome '0' signals an inconclusive result, in which case no repreparation step takes 
place. As explained before, this event is not taken into account for the characterization of 
the performance of the scheme. 

Let us now show how the stochasticity is included in the computation of the average 
fidelity. We first compute the probabilities Pq, Psc an d Psi of the following events: an 
inconclusive outcome, a suggestive outcome that correctly indicates the initial preparation 
and a suggestive outcome that incorrectly indicates the initial preparation, 

Po = ^Tr[n ^(|^ 1 ><^ 1 |)]+iTr[no^(|^ 2 ><^ 2 |)] , (5.17) 

P sc = ^Tt[U + £(\^ 1 )(^ 1 \)]+^Tt[U^(\^ 2 )^ 2 \)] , (5.18) 

P S i = l- Pi- P S r. (5.19) 
Then, the average fidelity is given by 

1 



(5F SDR ) 



Po 



Psc Qi^ii*f R >r+^ a i*? R >r) + 



(^l(^l* SDR )l 2 + ^K^I^ DR )l 2 



(5.20) 



where the division by 1 — P guarantees that only the suggestive outcomes are accounted. 
After some cumbersome manipulation, we obtain 



Comparing the above with the performance from the optimal deterministic discriminate 

tr* 2 



and replace from Eq. ( fU2D , we note that (3 rSDR ) > (3 rDDR2 ), since the denominator 1 — 



2 Recall from Ref. |156| that in our case of two mixed states of non-orthogonal support it is impossible to 
design a POVM for unambiguous state discrimination. That is why the outcomes '±' do not determine the 
initial preparation. 
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is obviously bounded between zero and one. Of course, it is the possibility of disregarding 
certain measurement outcomes that allows for this improvement. 

In Sec. |5]4] we will show that the fidelity ( |5.21|) can be obtained in a deterministic frame- 
work if we switch from the classical concept of discriminate-and-reprepare to a genuinely 
quantum approach to control quantum systems. Quite remarkably, we will see that Eq. (|5.21|) 
gives precisely the performance of the optimal deterministic quantum control scheme. 



5.3.3 Do Nothing 

Another control strategy would be to do nothing to correct the states. Although trivial, this 
strategy is of interest for comparison with other schemes. (There exist schemes that perform 
worse than this strategy, because of the feature of quantum systems that every measurement 
that acquires information will uncontrollably disturb the system.) This scheme does not lie 
within the set of discriminate-and-reprepare schemes described above (it is not described by 
an entanglement breaking map) but we will nonetheless refer to it as "classical." 
The average fidelity of this scheme is given by 

(3" DN ) = l - P cos 2 9. (5.22) 

This performance is plotted in Fig. |5.3| (b). Clearly, this scheme performs best for small 
amounts of noise (p ~ 0) and for input states with Bloch vectors that are near the z- 
axis (which is invariant under the dephasing noise). In some non-trivial regions of the 
(p, 8) parameter space, in particular in the range of low noise, this "do nothing" scheme 
outperforms the optimal deterministic discriminate-and-reprepare scheme. 



5.4 Deterministic Quantum Control 

In the previous section, we presented control schemes based on classical concepts. However, 
using techniques that may lead to optimal control schemes for a classical system may not 
necessarily lead to optimal schemes for a quantum system. As we will now demonstrate, 
the above deterministic classical control strategies can be outperformed by using a strategy 
based on quantum concepts, and the performance of the classical strategy can be obtained 
in a deterministic framework. 

We note that the classical schemes presented in the previous section lie at the ex- 
treme ends of a spectrum: the "discriminate-and-reprepare" strategy achieved maximum 
information gain and induced a maximum disturbance, whereas the "do-nothing" strategy 
achieved zero disturbance but produced zero information gain. As demonstrated by Fuchs 
and Peres [0], there exist an entire range of generalized measurements that trade off infor- 
mation gain and disturbance. A possible avenue for improvement in our control schemes 
is to tailor the measurement in such a way as to find a compromise, if one exists, between 
acquiring information about the noise but not disturbing the system too much as a result of 
the measurement. 

In the following, we exploit the non-uniqueness of the Kraus decomposition of a CPTP 
map (cf. Sec. [2.2. 3. 4|) to re-express the noise process E v in a way that suggests a strategy for 



constructing such an improved feedback protocol. 
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5.4.1 Reexpressing the noise 

To develop an intuitive picture, we will make use of a preferred ensemble for the quantum 
operation £ p describing the noise. That is, we use a decomposition of the operation into dif- 
ferent Kraus (error) operators than that given in Eq. (|5.1| ). The resulting quantum operation 
S p describing the noise, however, is equivalent. 

Consider the following quantum operation on a qubit, viewed on the Bloch sphere: with 
probability 1/2, the Bloch vector of the qubit is rotated by an angle +a about the z-axis, 
and with probability 1/2 it is rotated by —a about the z-axis. Rotations about the z-axis 
are described by the operator 

Z a = e ~ iaZ/2 = cos(a/2)l 2 - i sin(a/2)Z , (5.23) 

and the quantum operation is then 

£ a {p) = \Z a pZ ] Q + \Z„ a pZ ] _ a 

= sm 2 (a/2)(ZpZ) + cos 2 (a/2)p . (5.24) 

Thus, this quantum operation is equivalent to the dephasing noise S p , with p = sin 2 (a/2). 

Viewing the noise operation £ p with this preferred ensemble, it is possible to describe 
the noise as rotating the Bloch vector of the state by ±a with equal probability. A possible 
control strategy, then, would be to attempt to acquire information about the direction of 
rotation (±ct) via an appropriate measurement, and then to correct the system based on 
this estimate. Loosely, we desire a measurement that determines whether the noise rotated 
the state one way (+a) or another (—a). Then, based on the measurement result, we apply 
feedback: a unitary operation (rotation) that takes the state of the system back to the 
desired axis. 

A projective measurement, wherein the state of the system collapses to an eigenstate of 
the measurement, does not meet these requirements because such a measurement destroys 
the distinguishability of the two possible states. Instead, we consider the use of a weak 
measurement, with a measurement strength chosen to balance the competing goals of ac- 
quiring information and leaving the system undisturbed. We now show that such a strategy 
is possible, and that there is a non-trivial optimal measurement strength for this task. 



5.4.2 Weak non-destructive measurements 

For our quantum control scheme, we will make use of a type of measurement that satisfies 
two key requirements: (1) the strength of the measurement should be controllable, i.e., we 
should be able to vary the trade-off between information gain and disturbance (back-action); 
and (2) the measurement should be non-destructive, which leaving the measured system in 
an appropriate quantum state given by the desired collapse map. Such weak non-destructive 
measurements have recently been developed and demonstrated in single-photon quantum 
optical systems ||157| , |158| . 



Using the preferred ensemble describing the noise, Eq. ( |5.24| ), we expect intuitively that 
this weak measurement should be along the y-axis of the Bloch sphere in order to provide in- 
formation about which direction (±a) the system was rotated, without acquiring information 
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about which initial state the system was prepared in. One suitable family of POVMs con- 
sists of two operators given by E m = M^Mm, for m — 0, 1, where M m are the measurement 
operators [^] 

M = cos(x/2)|+*>(+2| + sin( X /2)H)H| , (5.25) 
M x = sin(x/2)|-N)(+z| + cos(x/2)|-z)<-z| . (5.26) 

The strength of the measurement depends on the choice of the parameter x- The eigenstates 
of Y are \±i) = (|0) ± z|l))/\/2. The probabilities of obtaining the measurement results 
m = 0, 1 for a qubit in the state p in are given by 

p m = Tr (E rn p in ) , (5.27) 

and the resulting state of the qubit immediately after the measurement is 

Pm 

Consider the following two limits. If % — 7r /2 the two measurement operators are the same 
and are proportional to the identity. As a result the outcome probabilities are independent of 
the state and the state of the signal is unaltered by the measurement. If x — 0, a projective 
measurement on the signal is induced: the signal state is projected onto the state 
when the measurement result is (1). For < x < 7r /2, the resulting measurement on the 
signal is non-pro jective but non-trivial. 

It is illustrative to view the effect of this measurement on the noisy input states on the 
Bloch sphere. In Fig. |5.4| (a) we can see that the effect of the noise is to shorten the length of 
the Bloch vector of the qubit state (making it less pure) while increasing the angle between 
the Bloch vector and the x-y plane from 9 to 9', where 9' > 9 . When the measurement is 
made, three things happen, as can be seen in Fig. |5.4| (b): 1) the Bloch vector is lengthened 
(the state becomes more pure); 2) the angle 9' decreases to some lesser angle 9"; and 3) 
the state is rotated about the z-axis one way or the other depending on the result of the 
measurement. The first two effects work towards our advantage (purifying the state while 
decreasing 9'); the third effect we attempt to correct using feedback. 

We will now describe how to implement this measurement using a projective measurement 
on an ancillary meter qubit and an entangling gate between the original signal qubit and 
the meter. The strength of the measurement can be controlled by varying the level of 
entanglement between the two qubits, which can be implemented by initiating the meter in 
the state |0) and subsequently applying a Y x rotation [as shown in figure |5.5|(a)l, where 

Y x = = - . (5.29) 

x V sm U/ 2 ) cos(x/2) J 

The parameter x ranges from to tt/2 and characterizes the strength of the measurement, 
with equivalent to a projective measurement and 7r/2 equivalent to no measurement. 

The entangling gate consists of a Xi rotation on the signal state, followed by a cnot 
gate with the signal state as the control and the meter state as the target, followed by a 
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Figure 5.4: (Color online) Bloch sphere representation of the effect of a weak measurement 
on the system. The transformations shown here correspond to having obtained the measurement 
result "0" (for the result "1", the behaviour would be a reflection in the x-z plane.) a) The 
two initial states are mapped to pi^ by the noise; b) a weak measurement is performed 

with < x < § j c) a strong projective measurement (x = 0) is performed projecting either 
state into |— i). While no measurement will not yield any information about the system, a strong 
measurement will maximally disturb the system. A weak measurement will gain some information 
while also limiting the disturbance on the system. 



on the signal state, where 

v V — zsm(0/2) cos(0/2) J 

and where the Pauli matrix X is given by X|0) = |1) and X\l) = |0). The rotations X±* 
are used to ensure that the resulting weak measurement on the signal qubit is performed in 
the {|+«), \ basis. The entangling gate then correlates (to a degree which depends on 
x) the \—i)} basis of the signal qubit to the {|0), |1)} basis of the meter qubit. 

Finally the meter qubit is measured in the basis {|0), |1)}, yielding a result or 1. This 
measurement on the meter induces a measurement on the signal that is precisely equal to 
the generalized measurement described by the measurement operators M m of Eq. ( |5.25| ). 



5.4.3 Feedback control 

Once a weak measurement has been performed, a correction based on the measurement 
result is performed on the quantum system: the feedback control. We choose the correction 
to be a unitary rotation about the z-axis, Z± v where 

Zv = ^ m = ( e 'T e 4JU) , (5.31) 
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Figure 5.5: (a) Circuit diag ram of the weak measurement scheme. The input signal state p m 
is entangled to the meter state using the cnot gate. The rotations ensure that the weak 

measurement of the signal state is made in the desired basis {]+£), |— i}}- The strength of the 
measurement is set using the rotation Y x . The meter state is measured in the computational basis, 
resulting in a classical signal (0 or 1) to be fed forward to the correction stage of the control scheme, 
(b) Circuit diagram of the control scheme. A weak measurement is made on the input state and, 
based on the measurement results, the signal state will be rotated by Z„ (Z^„) conditional on the 
result of the weak measurement being (1). 



with the aim to bring the Bloch vector of the qubit back onto the xz-plane. The angle of 
rotation is chosen to be ±rj, depending on the measurement result (+rj corresponding to the 
measurement result 0, and — r\ to the measurement result 1). It is possible to choose rj so 
that the system state is returned to the xz-plane for all values of p, 9 and x an d for both 
measurement outcomes by choosing 

tan n — -. r , (5.32) 

(1 — 2p) cos # tan x 

with 7] in the range < r] < tt/2. This angle t] can be calculated because the dephasing 
noise has been previously characterised (i.e., p is known). 

The resulting weak measurement followed by feedback is thus described by a quantum 
operation (a CPTP map) Cqc acting on a single qubit, given by 

C Q c(p) = (Z +v M )p(Z +v M o y + (Z_ r? M 1 )p(Z_ J? M 1 )t , (5.33) 

where the measurement operators M m are given by Eqs. ( |5.25[ ) and ( |5.26| ). 

In summary, the quantum control scheme operates by performing a weak measurement 



of the system and then correcting it based on the results of the measurement, as in Fig. |5.5| b. 
The weak measurement is made by entangling an ancillary meter state with the signal state 
using an entangling unitary operation, then performing a projective measurement of the 
meter state. The level of entanglement depends on the input state of the meter, which is 
controlled by a Y x rotation; this level of entanglement in turn determines the strength of 
the measurement. After measurement of the meter, the signal state is altered due to the 
measurement back-action. To correct for this back-action, a rotation about the z-axis is 
applied to the state, returning it back to the xz-plane. To characterise how well the scheme 
works, we now investigate the average fidelity 
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Figure 5.6: (Color online) (a) Fidelity of the quantum correction procedure with measurement 
strength (1 — 2x/7r) for a representative noise value {p = 0.145) and angle (9 = 0.715). The 
measurement strength ranges from a value of (corresponding to no measurement), through to a 
value of 1 (corresponding to a projective measurement). There exists an optimum measurement 
strength at which we balance the amount of information gained with the amount of back-action 
noise introduced. Also plotted for comparison are (b) the optimal "discriminate-and-reprepare" 
scheme and (c) the "do nothing" scheme for the same parameter values. 



5.4.4 Performance 

The performance of this quantum control scheme, quantified by the average fidelity ( |5.4j) , is 



<? QC > = \ 



1 + sin 8 sinx + cos 8\1 — (1 — r%) sin x 



(5.34) 



where r x — (1 — 2p) cos 6. 

We can see that (3 rCiC ) is a function of the amount of noise p, the angle between the 
initial states 8, and the measurement strength x- The dependence of this fidelity on the 
measurement strength, for fixed p and 8, is illustrated in Fig. |5.6| . For each value of p and 8, 
there is an optimum measurement strength Xopt which maximizes the average fidelity ( |5.34p . 



This optimum measurement strength is found to be non-trivial except for the limiting cases 
of p = or 8 = 0, 7r/2, and is given by 



sin 4 8 

Xopt (p, 8) = sin x - — . 4 , (5.35) 

'1 — r%) cos^ 8 + (1 — r l x ) sin 8 

as a function of the amount of noise p and the angle between the initial states 8. 

Substituting Xopt for x m Eq. ( |5.34j ), we get the following expression for the optimum 
fidelity: 

(jQC opt > = i + i / 2 e + s {5 36) 



Fig. |5T^(c) plots the quantum control fidelity as a function of the input state (character- 
ized by the angle 8) and the amount of noise (characterized by p). 
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Figure 5.7: A contour plot of the difference, as a function of the amount of noise p and the 
angle between the initial states 6, between the average fidelities of the quantum control scheme and 
the best classical scheme. The quantum control scheme performs significantly better for moderate 
values of p and 6 (0.05 < p < 0.3 and 0.3 < < 1). The maximum value F^a = 0.026 occurs at 
p = 0.115 and 9 = 0.715. 



We note that (g^ 00 ? 1 ^ = l for three limiting cases. If p — 0, there is no noise and 
so the state is not perturbed, resulting in unit fidelity for all values of 9 given by simply 
"doing nothing" (zero measurement strength and no feedback). When 9 = it/2, the states 
are orthogonal and point along the z axis. The noise does not affect these states, again 
resulting in unit fidelity for all values of p with a "do nothing" scheme. When 9 = the 
two states are equal and point along the x-axis. The control scheme reprepares this state 
after the noise by making a projective measurement x = to obtain either |+z) or and 
rotating back to the xz-plane (rj = vr/2). This results in a fidelity of 1 for all values of p. 



5.4.5 Comparison with Classical Schemes 



We now compare the quantum control scheme with classical schemes presented in Sec. |5T3 . 
Specifically, we first compare the quantum scheme with the best of the deterministic classical 
schemes at every point in the parameter space (p,9), i.e., we observe the difference in the 
average fidelities 

Fdif = ( J QCopt ) - max((? DDR2 > , (? DN » , (5.37) 

where (3 rDDR2 ) and (3 rDN ) are given by Eqs. ( |5.12|) and ( |5.22|) , respectively. Fig. |5.7| reveals 
that Fdif is always positive, and thus the quantum control scheme always outperforms the 
best of the classical strategies^. 

Comparing the quantum scheme with the stochastic classical scheme of Sec. [5.3.2|, we note 



that they both yield precisely the same fidelity [cf. Eq. (|5.21|) 1. In this case, the superiority of 
the quantum control scheme resides in producing this performance in a deterministic fashion. 



Quite recently, a rigorous demonstration of -Fdif > was given in Ref. [159 
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5.4.6 Optimality 

We now prove that our quantum control scheme is optimal, in that it yields the maximum 
average fidelity of all possible quantum operations (CPTP maps). Our proof makes use of 
techniques from convex optimization (specifically, those of ||19|| ) but is presented without 
requiring any background in this subject. In the Appendix |C.1| , we provide a more detailed 
construction of the proof. 

Consider the following optimization problem: determine the maximum average fidelity 

2 

(3***) = max(2 c ) =mwi^® i \C[S^ i )mWi), (5-38) 

i=l 

where the maximization is now over all CPTP maps C acting on a single qubit. 

Recall from Sec. |2.2.3.2| that any CPTP map C acting on operators on a Hilbert space H 
is in one-to-one correspondence with a (unnormalized) density operator £ on H ® H via 

C{q) = Tn [(q t ® 1 2 )£] , (5.39) 

and is subject to the constraint Tr 2 £ = 1 2 , where the subindexes 1 and 2 used next to 
the trace operation denotes the partial trace over the first and second subsystems, respec- 
tively [21, ^9], |160|| . With this isomorphism, the average fidelity (3 rC ) for the control scheme 



C is given by (3 rC ) = Tr (RC), where 

2 

R = 3 E fe(i^x^i)] T ® fa><&i • ( 5 - 4 °) 



i=l 



Thus, the optimization problem ( |5.38| ) can be rewritten as 

maximize Tr (R€) 

subject to £ > (5.41) 
Tr 2 £ = 1 2 . 

We now wish to prove that the maximum value of Tr (RC) subject to these constraints is 
given by (^ c °P t ) of Eq. ( ggg ). 

We note that, for any single-qubit operator M satisfying M <8> I2 — R > 0, we obtain the 
inequality 

Tr M - Tr (R£) = Tr [(M <g> 1 2 )£] - Tr (R€) 
= Tr [(M® 1 2 -R)C] 

>0, (5.42) 

where the first line follows from the constraint Tr 2 £ = 1 2 , and the inequality follows from 
the fact that (M ® 1 2 — R) > and £ > 0, and thus the trace of their product is non- 
negative. This inequality demonstrates that the value Tr M for any matrix M that satisfies 
the constraint (M®1 2 — R) > provides an upper bound on the solution of our optimization 
problem ( |5.41| ). 
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Consider the matrix M = &o(H-2 + r xX), where 

sin 4 ft 

(5.43) 

and r x = (1— 2p) cos# as before. It is straightforward to verify that the matrix bo1l4+r x boX ® 
2-2 — R > 0, and hence the value Tr M = 2bo provides an upper bound on the average fidelity 
of any control scheme. Because 2&o precisely equals the fidelity of our proposed quantum 
control scheme, given by Eq. ( |5.36| ), this scheme necessarily gives an optimal solution to the 
original problem ( |5.41| ). We refer the reader to the Appendix |C.1| for a more constructive 
proof of this result. 



5.5 Discussion and Conclusions 

We have shown how two key characteristics of quantum physics — that non-orthogonal 
states cannot be perfectly discriminated, and that any information gain via measurement 
necessarily implies disturbance to the system — imply that classical strategies for control 
must be modified or abandoned when dealing with quantum systems. By making use of 
more general measurements available in quantum mechanics, we have been able to design 
a deterministic quantum control strategy that outperform deterministic schemes based on 
classical concepts. Quite interestingly, our quantum scheme was also shown to achieve the 
same performance of an (arguably) optimal stochastic classical scheme, demonstrating that 
for the present problem the gap between classical and quantum deterministic control is just 
as large as that between deterministic and stochastic control. Whether this is a general 
feature of more general control problems, is an interesting avenue of future research. 

In constructing our quantum control scheme for the particular task presented here, we 
made use of several intuitive guides. First, we used a preferred (and non-standard) ensemble 
of the dephasing noise operator [cf. Eq. ( |5.24| )1, which allowed us to view the noise as 



"kicking" the state of the qubit in one direction or the other on the Bloch sphere. We then 
made use of a weak measurement in a basis that, loosely, attempted to acquire information 
about the direction of this kick without acquiring information about the choice of preparation 
of the system. It is remarkable (and perhaps simply lucky) that these intuitive guides lead 
to a quantum control scheme that was optimal for the task. It is interesting to consider 
whether such intuition can be applied to quantum control schemes in general, and if this 
intuition can be formalized into rules for developing optimal control schemes. 

While our scheme is indeed optimal for the task presented, it is not guaranteed to be 
unique; in fact, there are other decompositions of the same CPTP map into different mea- 
surements and feedback procedures [|161]J . In general, it is possible that an entire class of 
CPTP maps may yield the optimal performance. Also, the intuitive guides discussed above 
for our quantum control scheme — such as that the measurement essentially gains infor- 
mation only about the noise and not the choice of initial state — may not apply to other 
optimal schemes. 

In connection to this, we note that a similar feedback control scheme was investigated 



by Niu and Griffiths [|150|| for optimal eavesdropping in a B92 quantum cryptography pro- 



tocol [ TE2 \, see also In their scheme, the aim of the weak measurement was to maximize 
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the information gain about which of two non-orthogonal states was transmitted for a given 
amount of disturbance; in contrast, our weak measurement was designed to acquire no infor- 
mation about the choice of non-orthogonal states. Despite these opposing aims, the obvious 
similarity between these our scheme and that of Niu and Griffith warrants further investi- 
gation, particularly since we note that optimal feedback protocols exist based on different 
choices of measurement. 

It is also of interest to determine if non-trivial control schemes exist for other types of 
noise processes, or if these results can be generalized to larger numbers of initial states and 
to higher- dimensional systems. 

Finally, we note that the key element to our quantum control scheme — weak QND 
measurements on a qubit, and feedback onto a qubit based on measurement results — have 
both been demonstrated in recent single-photon quantum optics experiments. Specifically, 
Pryde et al. [ 157|| have demonstrated weak QND measurements of a single photonic qubit, 
and have explicitly varied the measurement strength over the full parameter range. Also, 
Pittman et al. [ |163|1 have demonstrated feedback on the polarization of a single photon 
based on the measurement of the polarization of another photon entangled with the first; this 
feedback was used for the purposes of quantum error correction, and is essentially identical 
to the feedback required for our quantum control scheme. Because these core essential 
elements have already been demonstrated experimentally, we expect that a demonstration 
of our quantum control scheme is possible in the near future. 
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Optimal tracking for pairs of qubit states 



6.1 Introduction 

A common goal of many problems in quantum information science is the search for quantum 
operations that simultaneously transform a set of given input quantum states into another 
pre-specified set. Well known examples are tasks such as quantum cloning, state discrimi- 
nation and quantum error correction. 

In general, though, quantum mechanics forbids arbitrary quantum state dynamics. As a 
result, one is left with several examples of "impossible quantum machines" ||164|| . Not only is 



quantum cloning unachievable |165| , |16(j| , |167| | , but also quantum state discrimination strate- 
gies are typically subject to non-zero misidentification probabilities |82] and/or inconclusive 



outcomes [|168| , |169j |170|1 and there are no quantum error correction protocols capable of 
fully reverting the action of an arbitrary noise model [ 147|| . 



Nevertheless, it is still possible to approximate ideal (but unphysical) transformations 
with optimal (but physical) ones. This provides quantum limits to the performance of tasks 
such as state discrimination, cloning and so on. In this chapter, we study the general problem 
of transforming the state of a single qubit into a given target state, when the system can be 
prepared in two different ways, and the target state depends on the choice of preparation. 
We call this task quantum tracking, a term borrowed from classical control theory. Our main 
result is an analytical description of an optimal quantum tracking strategy. 

More specifically, the quantum tracking problem studied here can be understood as 
follows. Consider that Alice prepares either a qubit state p\ with probability ~K\ or p 2 with 
probability 7r 2 . Bob is allowed to interact with the system in any physically allowed way, 
aiming to enforce the tracking rule 

if Alice prepared , then ouput Pj (6.1) 

for 2 = 1,2 and some given qubit density matrices 'p i . 
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Figure 6.1: (Color online) The transformation of the mixed states p± and p 2 (in blue) into the 
pure states pi and p 2 ( m re d) is not a physical one. However, there is a physical transformation 
C capable of transforming the input states into states C(pi) (in black) which closely approximates 
the targets. Note, in the detail, that C{p\) and C(p 2 ) are still slightly mixed. 



At his disposal, Bob has all the information about the possible preparations p« and their 
respective prior probabilities m, but not the actual preparation (the value of the index i). 

Because quantum states are generally not perfectly distinguishable, a strategy that at- 
tempts to identify Alice's preparation and then reprepare the target according to rule (|6.1|) is 
not always guaranteed to succeed. In fact, this limited distinguishability is an unsurpassable 
obstacle in the implementation of ( |6.1| ). 

Throughout, an optimal solution is defined as follows. Amongst all the physical transfor- 
mations acting on the input states pi, an optimal one is any map that outputs density matri- 
ces p\ such that the averaged Hilbert-Schmidt inner product between p\ and is maximal. 
When /7j are pure states, such a figure-of-merit coincides with the averaged Uhlmann-Jozsa 
fidelity |76|, |77| , and this notion of optimality gains an appealing operational interpretation 
[ p?| . Suppose that Alice [aware of her preparation and of rule (|6.1|) 1, decides to check whether 
Bob prepared the density matrix he was supposed to, and for that purpose she performs a 
verification measurement on the density matrix produced by him. If Bob chooses an optimal 
transformation according to the above prescription, then the probability he will pass Alice's 
test is as large as allowed by quantum mechanics. 

The tracking problem resembles the transformability problem for pairs of qubit states 
studied by Alberti and Uhlmann in the 80's [[171|| (see also Appendix p. 1|) . In ||171|| , a criterion 
based on the distinguishability between the source density matrices and the distinguishability 
between the target density matrices was developed in order to decide on the existence of a 
completely positive and trace preserving (CPTP) map simultaneously transforming each 
source into each target. 

Although Alberti and Uhlmann's criterion classifies the set of states pi, p 2 , p 1 and p 2 
for which rule ( |6.1|) can be satisfied, it does not provide a construction of the CPTP map 
implementing that transformation, nor touch the problem of how to find a feasible approxi- 
mation when the criterion is not satisfied. For many purposes, the requirement of perfectly 
converting sources into targets is unnecessarily strong, as some strictly impossible physical 
transformations can be very well approximated by physical ones, as illustrated in Fig |6.1| . 
In fact, any experimental realization of a map is just an approximation of it. 

Another problem closely related to our aims was investigated in Ref. B. Specifically, 
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we considered the problem of determining the optimal quantum operation to stabilize the 
state of a single qubit, randomly prepared in one of two pure states, against the effect of 
dephasing noise. The results of are here extended in several ways. The input states are 
allowed to be mixed and prepared with arbitrary prior probability distribution; the noise 
model is arbitrary and, most importantly, the stabilization task is replaced with tracking. 

Finally, there is an intrinsic connection between the quantum tracking problem and the 
"optimization approach" [^, |8£| |9(| 91, |92| to quantum error correction []. In these 



references, the encoding and recovery operations are regarded as optimization variables whose 
optimal values maximize a given figure-of-merit (typically a function bounded between and 
1, equal to 1 if and only if the noise dynamics is reversible). Efficient numerical methods are 
then proposed to solve the optimization problem. The key differences between our work and 
these references is that we do not consider encoding of the initial state and focus on reverting 
the noise dynamics experienced by only a pair of states. By doing so, the optimization of 
the recovery operation can be handled analytically for a conveniently chosen figure-of-merit. 

The chapter is structured as follows. Section ^]2| introduces the formal statement of the 
problem and our working strategy, which is proved to be optimal in Section |6.3| . In section 
|6.4| we evaluate the performance of the optimal strategy in the contexts of quantum state 
discrimination, quantum state stabilization in the presence noise, perfect quantum tracking, 
and state-dependent quantum cloning. Section |6.5| proposes a physical implementation of 
our strategy in terms of closed and open loop control. Section ^]6| discusses generalizations 
of the problem and concludes. 



6.2 Problem and Strategy 

In this section we give a formal statement of the problem of interest and introduce our 
strategy. 

6.2.1 The Problem 

Formally, the problem we set out to solve can be stated as follows: 

Problem 1. Given qubit density matrices p\, pi, ~p\, ~p~2 (with p\ ^ P2) and probabilities tti, 
7T2 with 7Ti + 7T2 = 1, find a quantum operation C maximizing 

(5F HS ) = tti Tr [C^OpJ + tt 2 Tr [C(p 2 )p 2 ] . (6.2) 

We will refer to this as "the tracking problem" . 

The choice of the average Hilbert-Schmidt inner product as our figure-of-merit (3hs) is 
motivated by technical reasons (to be clarified later), and by the fact that for pure target 
states (the case of greater interest as far as applications are concerned), (^hs) is precisely 



This is in contrast with the traditional approach to quantum error correction, which followed the direction 



of adapting classical coding techniques to the quantum domain [172]. 

2 As done in the previous chapter, throughout we drop the index 1 from (?hs)i f° r ease of notation. 
However, we preserve the index HS because here we do not restrict to pure target states, hence the distinction 
between 3F HS and 2f is relevant. 
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equal to the average fidelity. When the target states are mixed, (^hs) is a lower bound to the 
average fidelity |77| . Although not as well motivated as in the case of pure target states, the 
determination of the quantum operation C maximizing (^hs) can still be useful for mixed 
target states. For example, if a certain application requires tracking to be performed with 
average fidelity / and the optimal value of our figure-of-merit is such that (^hs) > /, then 
C is suitable for the task. 

As a final remark, note that we do not exclude the case p l = ~p 2 from the statement of 
the problem. However, we will exclude the case p x = p 2 = 1/2 from the following analysis. 
Obviously, this particular transformation is always feasible and achieved with the completely 
depolarizing channel. 

Next, we propose a strategy that will later be proved to be a solution of this tracking 
problem. 



6.2.2 The Strategy 

In this section, we provide an analytical solution of the tracking problem, i.e., we detail 
the structure of an optimal tracking operation C and derive closed forms for the associated 
maximal value of the figure-of-merit (3hs)- The scheme proposed here was constructed by 
incorporating some features observed from the numerical solution of the tracking problem 
in an analytical optimization procedure. In the next section, we will show that the tracking 
problem can be cast as a semidefinite program (SDP) JTJ], pO|j , and will employ the theory for 
this type of optimization problem to prove that the strategy presented here actually solves 
the tracking problem. 

A quantum operation is a description of a certain physically allowed evolution of a quan- 
tum state. For a closed quantum system (not interacting with an environment) this descrip- 
tion is given by the familiar unitary evolution of Schrodinger's equation. For open quantum 
systems, unitary evolution alone does not account for every possible state transformation - 
in this case, the set of quantum operations is identified with the more comprehensive set of 
completely positive and trace preserving (CPTP) maps. 

Any one qubit CPTP map C can be decomposed as J64 , 55 1 



C(q) = UV(VqV*)rf , (6.3) 

where U, V are unitary matrices and T> induces an affine transformation on the input 
Bloch vectors; namely, it contracts the x, y and z components via a multiplicative factor, 
and subsequently adds a fixed number to them. Any non-unitary evolution arises from a 
transformation of this type. In the framework of Eq. ( |6.3|) , unitary dynamics is simply 
obtained by making the affine map T> redundant (e.g., multiplying by l's and adding O's to 
the x—, y— and z— Bloch components). 

In general, CPTP maps reduce the distinguishability of quantum states. On the Bloch 
sphere this typically corresponds to a reduction of the Bloch vector length and angles between 
vectors. In contrast unitary dynamics preserves the angles between Bloch vectors and their 
lengths. For the tracking problem, we can imagine that in some cases the optimal strategy 
will preserve lengths and angles, i.e. it will be some unitary correction. We will construct 
an "indicator function" which will flag this case. 
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6.2.2.1 Indicator function 

To gain some intuition, we start by constructing an indicator function for the simplest case 
of tracking with uniform priorities 7Ti = 112 = 1/2 from pure states (pi and P2) to pure states 
(p 1 and ~p 2 ). Throughout, 6 and 9 will denote the angles between the Bloch vectors of p±, 
P2 and ~p 1 , ~p 2 i respectively. It will also be convenient to define 9 and 9 to be the half-angle 
between the Bloch vectors, i.e., 29 = and 29 = 0. 

Given that all the states involved are pure, it is straightforward to conclude that if = 0, 
then unitary dynamics is the best choice — a suitable rotation of the Bloch vectors of the 
inputs can perfectly bring them to coincide with the Bloch vectors of the targets (as opposed 
to a non-unitary evolution that would decrease the angle, hence excluding the possibility of 
perfect tracking). 

A corollary of a theorem by Alberti and Uhlmann [ 171 (see appendix |U. 1| ) implies that 



any pure state transformation such that > 0, can be perfectly implemented. If that is 
the case, then this transformation must be non-unitary, since a unitary would not be able 
to bring the angles to perfectly match. This suggests the introduction of the function 

n : =e-e, (6.4) 

to indicate non- unitary dynamics whenever Q > 0. Next, we argue that Q < indicates 
unitary dynamics, thus establishing Q as an example of indicator function we were looking 
for. 

We have already seen that Q — implies unitary dynamics. Intuitively, this conclusion 
can be extended to Q < with the following reasoning. If < 0, any further decrease of 
the initial angle can only further separate the resulting states from the targets. Since there 
is not any quantum operation capable of increasing this angle, the best policy must be to 
preserve it, hence a unitary. 

The above discussion may suggest that the optimal indicator function is merely a com- 
parison of the distinguishabilities between sources and targets. If the sources are more 
distinguishable than the targets, then we employ a quantum measurement to decrease the 
distinguishability, hence approximating the targets. If the sources are no more distinguish- 
able than the targets, then we employ a unitary operation to avoid a further decrease of the 
overlap between the output states and the targets. Although this reasoning is certainly in 
agreement with the indicator function introduced above for the special case of pure states, 
it does not extend to mixed state transformations 0. 

If the states are not pure and the priorities are not uniform, it is much more difficult 
to understand how purities, angles and priorities combine to form a meaningful decision 
criterion about the nature of the best dynamics. In order to introduce an indicator function 
for this general case (obtained from some mathematical optimization procedure, not from 
an heuristic argument), we first define some useful notation. 

Let Ri be the Bloch vector of pi and Ri the "Bloch vector" of 'Kip i [or, more precisely, 
the Bloch vector of the normalized density matrix (1 — 7r»)l/2 + nipi]. Symbolically, for 



3 At least not as far as the optimization of the figurc-of-merit of Eq. ( |6.2| ) is concerned. A possibly 
interesting problem would be the determination of a figure-of-merit that would preserve such behavior for 
general state transformations. 
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x n 2 



TZ G {R, R}, define 

(6.5a) 
(6.5b) 
(6.5c) 

and as usual, the corresponding unbolded type gives the Euclidean norm 1Z{ + - tX y = ||72.{ + _ )X } | 
Also, the following will be important throughout 

2 

T := C 1 - Ri ■ R ^ ■ Kj) - ( 6 - 6 ) 



S'.= jT* + AR\{EtL-Rl). (6.7) 

We note that i?i ^ -R2 guarantees that R?_ — R\ > (see Appendix p.2.1| , Lemma p.l| ), 
hence both S and T are real numbers. 
In terms of these quantities, we define 

Q, := S + T-2R X R X , (6.8) 

with Q > indicating that non-unitary dynamics are required (which will be detailed as 
"procedure A") and Q < indicating that unitary dynamics ("procedure B") are required. 

Although it would be difficult to motivate the indicator function Q of Eq. fl6.8|) as we 
did with Q in Eq. (|6.4| ) , it is possible to see that the former is equivalent to the latter in the 
case of pure qubit states. This is shown in Fig. |6.2| , where it is also noticeable that even a 
simple generalization of the input states from pure to mixed states with the same level of 
mixedness (as measured by the norm of their Bloch vector R), is already sufficient to give a 
fairly non-trivial division line between the two types of dynamics. 

6.2.2.2 Procedure A 

In this section we present the details of the map C from Eq. ( |6.3| ) for Q > (which indicates 
non- unitary dynamics). 

Step 1. The rotation by the unitary V takes the two input Bloch vectors to vectors R[ 
and R 2 in the xz-plane in such a way that they share a common positive £c-component and 
R[ ■ z > R' 2 ■ z, explicitly 

B,Afli = |. + ^». (6.9) 

Step 2. The affine transformation T> shortens the x, y and z components of its inputs by 
multiplying them, respectively, by /ii, /x 2 , 7/3 with < ^{1,2,3} < 1 and subsequently adding 
s\ to the x component. Applied to R^ that reads 

R't ~ r: = L /is^^i . (6.10) 
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Figure 6.2: Lines separating unitary and non-unitary dynamics, as prescribed by the indicator 
function of Eq. ( |6.8| ). For source states, we consider pairs of mixed states with Bloch vector length 
R and separated by an angle = 28 (Bloch sphere angle). For target states, we consider pairs 
of pure states separated by an angle = 28. The fidelity between the source states (horizontal 
axis) is 1 — R 2 sin 2 8, and the fidelity between the target states (vertical axis) is cos 2 8. The region 
where unitary dynamics is advisable (0 < 0) is indicated with an arrow. The intuitive notion 
that a measurement is employed when the targets are less distinguishable than the sources (and a 
unitary, otherwise), only holds if R = 1 (pure sources). For R = 0.9,0.8 the division line moves 
down in such a way as to increase the portion of the parameter space where non-unitary dynamics 
is advisable ($7 > 0). 



That such a transformation can be physically implemented is not a trivial fact. Indeed, 
strict conditions involving the parameters /i{i,2,3} an d Si must be satisfied to guarantee the 
feasibility of transformation ( |6.10 ) as a CPTP map |65| . The following values can be shown 
to satisfy these conditions 



M2 



1^3 



Si 



S(S + T) 



-R X R X R_ 



2 X 7? ft 



S + T 



S(S + T) 



Rx R- 



2S(s + ry 



(s + Tf -ar\r 2 x 



(6.11a) 
(6.11b) 
(6.11c) 

(6. lid) 



In Appendix p.2.1| we show that the only circumstances under which the inequalities 
S > and S + T > are not simultaneously satisfied have fl = 0. Therefore, the quantities 
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above are real and well-defined for the present procedure (Q > 0). It is not difficult to check 
that /ii = ^2^3 and si = [1 — (1 — /i|), so the resulting map acting on density matrices 



is an extremal point of the convex set of CPTP maps |p5 



Remarkably, if the target Bloch vectors -Ri and R 2 are parallel or anti-parallel (i.e., 
R x = 0), Eqs. ( |3.11| ) simplify to //i = fi 2 = A*3 — an d si = 1. This implies that R" = x 
for i — 1, 2, or equivalently, that V outputs |+) = (|0) + |l))/\/2 independently of the input. 

Step 3. For R x ^ 0, the unitary U rotates the input vectors R" to lie on the plane 
determined by the target vectors, and within that plane by a suitable angle. For R x = 0, U 
simply rotates the vector x in order to align it with R + . In either case, U can be expressed 
as the following map 



R'j 1 — > R 1 ! 1 — ki\R\ + ki 2 R2 



(6.12) 



with 



k - — i 



a 2 + pipjR 2 x 



-ly+iaiPi-^Rr-R- 



a- 



"R 2 + + + (3 2 R 2 \\ 2 + 2a (fix - &)] i?' 



—2 

x 



(6.13) 
(6.14) 



where we have defined 1 = 2 and 2 = 1; and 



a = R, ■ x 



R x 



S + T 
2S 



S(S + T) 



Ri ■ R . 



(6.15) 
(6.16) 



With this, the figure-of-merit of Eq. ( |6.2p can be shown to be 

Pes) = 2 + y ' 



(6.17) 



where T a is obtained by substituting Eqs. (|6.15 ) and ( 6.16|) into Eq. ( p,14| ) and reads 



-2 2R 2 _R X 



(6.18) 



Fig. |T3] illustrates this sequence of transformations for the case R x ^ 0. 



6.2.2.3 Procedure B 



As pointed out before, if Q < then the affine transformation T> is not implemented and 
the product VU gives the unitary dynamics. In this case, V can be chosen precisely as in 
step 1 of procedure A. 
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Figure 6.3: (Color online) Bloch sphere schematics of procedure A. a) The source (target) 
Bloch vectors Ri and R2 (R\ and R2) determine the plane tt s (irt)- b) V implements the rotation 
transforming ir s to the xz-plane, in such a way that the vector R'± — R' 2 is parallel to +z. c) The 
map T> deforms the Bloch sphere into an ellipsoid of semi-axis 112 and ^3 and translates it by 
si along the x axis. The resulting ellipsoid touches the original Bloch sphere — a feature related 
to the fact that T> is an extremal CPTP map. d) U rotates the resulting states to the plane 7Tt, 
and within that plane by some angle such that the resulting states R'" and R 2 approximate R\ 
and i?2, respectively. 



For R x 0, we preserve the form of the transformation U from Eq. (|6.12 ), but the 
values of a and $ are given by 



a 



Ri ■ x = — — 



Pi -= 

K x 



R x 
R. 
Ri • R- 

R x R- 



(6.19) 
(6.20) 



For R x = 0, assume that the Bloch vectors Ri and R 2 are anti-parallel (this is without 
loss of generality, since parallel targets always exhibit Q > Q). In particular, take Ri 
parallel to z and R2 parallel to —z. The following transformation specifies U in this case 



where 



Ri 



RX Q 

— — cos V 
R— 



{Ri ■ R 
R~ 



sin •& 



x + 



Rx ■ 4 1 

— — sin v + 
R— 



Ri ■ R- 
R~ 



cos?9 



sin 



-Rx (Ri — R2) 



T 



R-\ R 



(6.21) 



(6.22) 



and cos$ = +\/l — sin 2 d. We note that d is a valid angle since the rhs of Eq. ( |6.22 ) is 
bounded between —1 and 1 0. 



To see this, note that if the targets arc parallel it is immediate that £1 = \T\ 

-2 



T and our claims holds if 
2 



T > 0. A straightforward computation shows that T = {R\ + R 2 f - {R{Rl + RiR" 2 + 2RiR 2 RiR 2 cos Q), 
which clearly achieves the minimum value iff i?i = R 2 — cosG = 1. Ho wever , this requires the sources to 
be identical, which is excluded from the statement of the problem in Sec. |6.2.1 . Therefore, T > 0. 



°This follows easily from the inequality R x < i?_ (Appendix D.2.1) and from R + 



\Ri-R 2 \ < \/rI 



T 



(the equality follows from the anti-parallelism of the target Bloch vectors and the inequality from the fact 
that T < for R x = and Ct < 0) 
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If R\ and R2 do not align along the z direction as specified above, we simply apply a 
further rotation that aligns the z axis with the direction R\jK\. 

In both R x 7^ and R x = cases, the average fidelity can be computed to be 

<3hs> = \ + y , (6-23) 

where Tb is obtained by substituting Eqs. ( |6.19| ) and ( |6.20|) into Eq. ( |6.14|) . After some 
manipulation we find 

R\ - T + 2R X R X . (6.24) 

Finally, let us note that a more compact and mathematical description of procedures A 
and B is provided in the Appendix p.3| . 



6.3 Optimality Proof 



In this section, we employ duality theory for SDPs to prove the following theorem: 



Theorem 6.1. Our tracking strategy (as described in Sec. \6.2.2\ ), implements optimal track- 
ing between any pair of source and target qubit states and is, therefore, a solution of the 
tracking problem introduced in Sec. \b\2.\ 



In the subsequent proof of this theorem, some familiarity with SDP theory is assumed. 
Standard reviews on the topic are |18|, |2~0 |. More closely related to our purposes is ||19|| , where 
the connection between optimization of quantum operations and SDPs was first noted. Also 
relevant is Ref. 0], where a similar technique was used to approach a particular case of 
tracking. 



6.3.1 The tracking problem as a SDP 

We start by showing that the tracking problem can be formulated as a SDP. Formally, it 
can be written as 



max Vlr [C{pi)ir iPi 

CgQ sct 

A 8 = 1 



It will be convenient to rewrite C(pi) as |49| 

C(fH) = Tn [{pi ® 

where £ is the (unnormalized) Choi matrix 

£= (J®C)(|^ + )(^ + |) 



(6.25) 



(6.26) 



(6.27) 



and \^ + ) = 1 00) + 1 11). Eqs. ( 6.26j ) and (|6.27| ) establish a one-to-one relation between the set 
of CPTP maps on qubits and the set of (unnormalized) 2-qubit density matrices satisfying 
Tr 2 £ = I2 HH HI ^0[ |5l| . Here, Tr 1 ( 2 ) denotes the partial trace operation over the first 
(second) qubit. 
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Using this isomorphism, a straightforward manipulation gives for the objective function 
in ( |6.25| ) the form — Tr (-Fo^), where 

2 

Fo = -Y,PJ® 7r iPi- ( 6 - 28 ) 

i=l 

whereas the constraint C G <2| ct becomes € > and Tr 2 €, = 1 2 . In conclusion, the tracking 
problem assumes the standard form of a SDP 

maximize — Tr (F €) 

subject to € > (6.29) 
Tr[(a k ® !)<£] = 25m, fc = 0, 1,2,3. 



A special feature of ( |6.29|) which will be explored next is that the replacement pi 1— > 
ApiA ] and p i i-> B^B, with A, B e U(2), yields another SDP (with F replacing F ) 
that achieves exactly the same optimal value. This can be easily seen by noting that: (i) 

Tr (F €) = Tr (P €j, for € = (A T <g> B)^(A J ® 5) and (ii) € satisfies the constraints in 
(|6.29| ) if and only if <£ does. 

6.3.2 The duality trick 



As shown in Eqs. ( |2.57| ) and ( |2.5<j| ), the strategy described in Sec. |6.2.2| can be written in 



terms of the Choi matrix as € = (V T <g> U)D(V T <g> U) ] , with 

£ = \t®l + ^t®X + ®X-^Y ®Y + ^-Z®Z , (6.30) 

£t £t £j —1 

where X, Y and Z denote the Pauli matrices. Given the reasoning of the previous section, our 
strategy constitutes an optimal solution to the tracking problem if and only if the following 
SDP is solved with £ = £>, 

maximize — Tr (^Fq&J subject to .ft > and Tr [(cr& <g> 1)$] = 25ko , A; = 0,1,2,3, 

(6.31) 

with 

2 

Fo = ~Yl ( V P^y ® U^iPiU . (6.32) 

i=l 

The above SDP has the strong duality property, i.e., its optimal value is guaranteed to be 
identical to the optimal value of its dual problem |20[| . This fact follows, for example, from 
the "strict feasibility" of the point & = 1® 1/2, which satisfies the constraints of ( |6.31|) with 
the strict inequality &> 0. 

From duality theory for SDPs (cf. Sec. [2.3.2|) , the problem above is solved with ft = 3D 



if and only if (i) 3D satisfies the constraints of ( |6.31| ) and (ii) the linear matrix inequality 

F = F + pol ® 1 + yiX ® 1 + y 2 F <g> 1 + y 3 Z (g> 1 > (6.33) 
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is satisfied by some quadruple (y , Ji, £3) such that 

2 f0 = - Tr (f D 



(6.34) 

If that is the case, then the so-called "complementary slackness" condition jnj, DF = 0, 
holds for the appropriate values of coefficients r. , Pi, £2 and r. 3 . 

To see that (i) is verified, recall that the values of ^1,2,3 and s\ were chosen to make of 
T> an (extreme) CPTP map. As mentioned before, the Choi matrix of any such map (on 
qubits) is characterized by the constraints of problem (|6.31|) . 

For (ii), first note that — Tr (^Fq£)^ is merely (Uhs) or (^hs) gi ven i n Eqs. ( |6.17[) and 
( |6.23|) , depending on whether Q > or Q < 0. However, for later use in Ch. [7], we will 
consider the more general case where 'p i is not necessarily normalized (our intention is to show 
that our tracking strategy is still optimal in this case). Accounting for this generalization, 
we have 

|(c + r a ) if n>o 



Tr F n S) 



c + iY) if n<o 



(6.35) 



where c := C\ + C2 and q := ^Tr/^ (for normalized Pj, we have q = 7Tj and c = 1). 

In our particular problem, the complementary slackness condition results in sufficient 
independent linear equations that £ , ji, j: 2 and j: 3 are defined precisely. We find p 2 = and 



If Q > 0, 







f = 






R x 


Pi = 


AR_ 




1 


?3 = 


4i?_ 



c + r a ) , 

(cjJii + c 2 i? 2 ) • H- 



(6.36a) 
(6.36b) 

(6.36c) 



If ft < 0, 



? = 


4 (C 




1 


Pi = 


4R_ 




1 


?3 = 


4i?_ 



c + r t 



(cifli + c 2 i* 2 ) • + — 

J- 6 



(6.37a) 
(6.37b) 

(6.37c) 



where, for brevity, we have defined 

2 



i=l 

£ '■= R X ~R, + R x Rt 



(6.38) 
(6.39) 
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which can be shown to satisfy the relation 



~ 2 + £ 2 



(6.40) 



We prove in Appendix p.2.2| that, although T a and Tf, appear in the denominator of some 
of the coefficients in Eqs. ( |6.36|) and ( |6.37|) , no singularities occur if the indicated range of 
fl is observed. 

With the set of coefficients ( B.36j ) and (|6.37|) , Eq. (|6.34|) is clearly satisfied. As a re- 
sult, the optimality of our tracking strategy is solely dependent on proving the linear matrix 
inequality F > for the above set of coefficients. In Appendix p.2.3| we study the char- 
acteristic polynomial of F and conclude that all of its roots are non-negative, thus proving 
Theorem 16.11. 



6.4 Examples 

In this section we evaluate our tracking strategy at work in some physically relevant problems 
such as quantum state discrimination, quantum state purification, stabilization of quantum 
states in the presence of noise and state-dependent quantum cloning. Moreover, we also 
discuss the application of our strategy in circumstances where tracking is known to be 
perfectly achievable. The analysis presented in this section is meant to give an explicit 
account on the wide range of physical applications of the tracking problem and its optimal 
solution. 



6.4.1 Quantum State Discrimination 

A standard result in quantum state discrimination is the Helstrom measurement |82| , which 
consists of a projective quantum measurement that maximizes the probability (PHcist) of 
correctly identifying the state of a quantum system that could have been prepared in two 
different states. Describing the possible preparations by p\ with probability pi and pi with 
probability p 2 , the Helstrom measurement gives 



Heist 



\ + UpiPi -PipiWtr, (6.41) 



where || • || tr denotes the trace norm. 

In this section, we propose a quantum state discrimination protocol for a pair of qubit 
states based on the tracking strategy introduced in Sec. |6.2.2| . We will show that it is 



equivalent to Helstrom's strategy, as it will give the same correct identification probability 
ofEq. qOip . 

Our quantum state discrimination protocol consists of two simple steps: First we apply 
an optimal tracking operation C to approximate the states to be discriminated to some pair 
of orthogonal states. Without loss of generality, we take pi = |0)(0| and p 2 = |1)(1|. The 
priority of each transformation is taken to be identical to the prior probabilities with which 
pi and p2 are prepared, i.e., 7Tj = pj. As the second and final step, we perform the quantum 
measurement {|0)(0| , |1)(1|}, under the understanding that an outcome '0' suggests the 
preparation to be pi and an outcome T suggests p%. 
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The probability of a correct identification under this tracking scheme is given by Born's 
rule, averaged with the prior probabilities, 

Ptrack = Pi Tr [C(pi) |0)(0|] + p 2 Tr \C{p 2 ) |1)(1|] 

= m Tr [C(pi)ft] + vr 2 Tr [C(p 2 )p 2 ] . (6.42) 



By comparing Eqs. ( |6.42|) and ( |6.2| ), one promptly recognizes that Ptrack is precisely the 



performance of the operation C for tracking from to p { with priority 7Tj, as measured by 
(3"hs)- Hence, in the case of p 1 = |0)(0|, p 2 = |1)(1| and 7Tj = p iy Eqs. ( |6.17D and ( |6.23D give 



the probability of success of our discrimination scheme for fl > and Q < 0, respectively. 
Next, we make these formulas more explicit. 

Using the condition P x = in Eqs. (|6.7|) and ( |6.8|) , we obtain Q = \T\ + T. Essentially, 
this means that T assumes the role of the indicator function: if T > 0, then Q > and 
we employ procedure A; if T < 0, then Q = and we employ procedure B. Substituting 
Ri = —{ — l) l piZ into Eq. (|6.6|), some simple algebra gives 



T = (P1-P2) 2 - \\piRi - P2R2W 2 , (6.43) 

and we can write 

P - j | + |bi-P2| if T>0 , s 

Ptrack " U + ilbi^ - V2R2W if t < ' (6 - 44) 



where the first line follows from Eq. ( |6.17| ) and the second from Eq. ( |6.23p 



It is a tedious exercise (essentially the computation of the eigenvalues of pipi — p 2 p 2 ) to 
re-express Eq. ( |6.41| ) in terms of the Bloch vectors R4. The result is exactly 



Heist 



Ptrack , (6.45) 



hence establishing the claimed equivalence between our strategy and Helstrom's. 

Note that if T > 0, Ptrack is independent of the states we are trying to distinguish, but 
merely dependent on the probabilities with which they occur. This can be understood by 
looking at the details of the affme operation taking place in procedure A. As noted before, for 
R x = (as is the case for orthogonal targets), the affine map is such that p\ — p 2 — A*3 = 
and Si = 1; that is, the source states are completely depolarized and a new state |+) is 
prepared instead. Next, this state is rotated by the unitary U and the measurement is 
finally performed. 

It is easy to see that for pi = p 2 = 1/2 the condition T > (procedure A) never holds. 
However, as we deviate from the uniform distribution, the volume of the parameter space 
where procedure A is recommended grows to fully cover the space when pi — or p\ = 1. 
This is shown in Fig. |6.4|. 



6.4.2 Quantum state Purification 

In this section, we consider a kind of state purification task where we aim to transform a pair 
of mixed source states p\ and p 2 with the same degree of mixedness (Pi = R 2 = R < 1) that 
are separated in the Bloch sphere by an angle 29 e (0, it] into a pair of pure target states ~p x 
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(a) Pl < 1/2 (b) Pl > 1/2 



Figure 6.4: (Color online) Each color of sheet represents a fixed deviation from the uniform 
probability distribution. The sheets divide the parameter space in two regions. The arrows desig- 
nate the regions where procedure A (non-unitary) is recommended. On the other side, procedure 
B (unitary) is recommended. The larger the deviation from p\ = 1/2, the larger the region where 
a non-unitary preparation for the measurement {|0)(0| , |1)(1|} is advisable. 



and J> 2 separated by the same Bloch sphere angle 29. In other words, our purification task 
consists of elongating the Bloch vectors while preserving the angle between them. 

For later use, it will be convenient to derive formulas for the indicator function and 
figure-of-merit of a slightly more general problem, where the angle between the target Bloch 
vectors is 29 G [0, ir]. The purification task can be recovered by restricting 9 = 9. In addition, 
we will allow the priorities of the transformations pi — > 'p i to be arbitrary positive scalars 7Tj 
such that 7Ti + 7T2 = 1. Later, we make 7Ti = 7r 2 , in order to simplify the formulas. 

In this generalized purification framework {9 ^ 9), the indicator function is obtained 
from Eq. (|6.8|), by incorporating the conditions Ri = Hi (purity of the targets) and Ri = R 
(common mixedness of the sources) in the expressions for R X R X , T and S, from Eqs. ( |6.5c|) , 
( |6.6| ) and (|6.7|), respectively. These have a particularly appealing form: 



R X R X = ^TZ S TZ C [n + n_ - 5} , (6.46a) 
T = (1 — TZ C )U + - U s Ii_ , (6.46b) 



s = ^[(i- n c )u + + n s u_Y - A5n s (i - n c ) , (6.46c) 

where we have defined 7Z C := R 2 cos 2 9, 1Z S := R 2 sin 2 9,5= {tx\ — 7r 2 ) 2 and 

n± := vr 2 + 7T 2 ± 27Ti7r 2 cos 29. (6.47) 

From the above equations, the indicator function and the figure-of-merit can be immediately 
obtained. At this point, though, we specialize to the case 5 = (i.e., 7Ti = 7r 2 = 1/2) and 
give explicit formulas in this particular case. From Eq. ([fT 

Q = 



1 - TZ C )U + - y/KsKJL+n^ (6.48) 
2 [cos 2 9 - R 2 cos 9 cos 9 cos (9 -9)] , (6.49) 
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Figure 6.5: Purifying a pair of mixed states with Bloch vectors of length R < 1, separated by 
an angle 29 S (0, tt] with priorities tti = iT2 = 1/2. The plot shows the optimal average fidelity 
for different values of R and the control parameters s± and ^{1,2,3} f° r R = 0.6. The purification 
procedure attempts to increase, as much as possible, the length R while preserving the angle 9. 



where, in the second line, we used that I1 H 
From Eqs. ( pT7| ) and QET23D , 



cos 



2 9 and n. 



sin 2 6 when 7Ti = 7r 2 = 1/2. 



(^HS) 



cos 2 9 + 



if n > 



-/? 2 cos 2 6» 

i#cos(0-£) if < 



(6.50) 



It is now straightforward to see that, if 9 = 9, then Q > with saturation if and only if 
9 = 7r/2. That is, unless we are trying to purify from antipodal mixed states to orthogonal 
states, the best strategy is always a non-unitary transformation (procedure A). The optimal 
average fidelity of the purification scheme can be obtained by using 9 = 9 in Eq. ( |6.50| ). 
The resulting optimal purification performance is shown in Fig. |T5] and corresponds to the 
best achievable average fidelity allowed by quantum mechanics to the purification problem 
at hand. 

From Fig. |6.5| , we see that for small 9, (3hs) is typically high, regardless of the length 
R. This can be understood in analogy to the fact that collapsing a set of mixed states into 
a single pure state is always perfectly achievable. In fact, such a collapse is nearly what is 
needed in this domain, since a pair of pure target states separated by a small angle can be 
well approximated by a single pure state. Fig. IOI confirms this reasoning by showing that, 



in the small 9 domain, the source Bloch vectors are strongly compressed due to the small 
values of /i{i,2,3} and then strongly elongated due to the large value of si. 

For increasing values of 9 the fidelity decreases. Such a decay is accentuated if the degree 
of mixedness of the source states is high (small values of R), reflecting the intuitive idea 
that it is harder to purify very mixed states. In these intermediate regions, a non-trivial 
combination of compressions /i{i,2,3} and translation s\ of the Bloch vectors forms the optimal 
purifying scheme. Noticeably, the optimal procedure has less effect on the qubit (decreasing 
Si and increasing //{i^^}) as 9 increases. 
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At 9 = 7r/2, we have Q = and an optimal unitary transformation is actually to do 
nothing (the unitary transformation V is undone by another unitary V^, see Sec. 6.2. 2. 3| ). 
Note that although the plot of fii and fi2 in Fig. |6.5| approaches a constant value in between 
and 1, the vanishing indicator function introduces a discontinuity in the purifying operation, 
since now we should use procedure B, hence /ii = /i 2 = 1 at 9 = ir/2. Nevertheless, the 
values of fii and /12 are utterly irrelevant in this case. At this stage both Bloch vectors are 
aligned with the z direction, thus any compression along x and y cannot affect the states 
of interest. 

6.4.3 Stabilizing pure states 

A possible use for tracking is to try to cope with the presence of noise in quantum computa- 
tion and communication involving qubits. In general, noise processes (we restrict ourselves 
to CP processes) cannot be inverted by another CP map, not even when the noise is per- 
fectly known (]. However, instead of stabilizing the full Bloch sphere against noise, one may 
be interested at stabilizing only a limited number of states. Although not perfect, it is not 
uncommon that good stabilization can be achieved within this framework. 

In this section, we consider a quantum error correction task of this type, which was 
studied in detail in Ref. ||. We will show that the optimal correction scheme is merely a 
particular case of the quantum state purification procedure (with 9^9) introduced in the 
previous section. 

Assume that Alice prepares (with equal probabilities) a qubit in one of the non-orthogonal 
pure states 

) , (6.51a) 

) , (6.51b) 

where |±) = (|0) ± |l))/v / 2 and 9 is the half-angle between l^) and \ip2) i n the Bloch sphere 
representation, hence 9 G (0,7r/2). She then sends her qubit to Bob through a dephasing 
channel 

S[s)=pZqZ + {l-p)q, (6.52) 

where p is a constant in the range (0, 1/2] that has been previously determined. Bob, who 
does not know which of the two states was prepared, has to apply a quantum operation so as 
to ensure that, when Alice performs a check-measurement {\ipk){ipk \ , 1 — l^fcXV^I} (with k 
labeling the identity of her actual preparation) on Bob's output, the probability of detecting 
her original preparation is as high as possible. This probability equals the average fidelity 
between the possible inputs and the outputs of Bob's operation. 

Our tracking strategy can be of assistance to Bob if he regards the two possible noisy 
states as the source states pi = E{\ipi){ipi\) and tracks (with equal priorities i\i = 1/2) to the 



9 9 

l^i) = COS 2 1+) + sin 2 

W2) = cos - |+) — sin - 



6 This follows from the semi-group structure of CP maps. An obvious exception arises by restricting to 
the group of unitary noises. In fact, a theorem by Wigner states that this is the only exception (see [173| 
for a proof, see also [174 1 ) 
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target states J> i = In this case, the target states are pure and the source states have 

the same degree of mixedness [this follows easily from the application of the dephasing map 
to the states of Eq. (|6.51|) 1, which is precisely the scenario we considered in the last section 
for quantum state purification. 

The indicator function Q can then be obtained from Eq. ( 6.49|) by using the following 



identities for the angle 9 (recall that 9 is the half-angle, in the Bloch sphere, between the 
states output by the dephasing noise), 

sinfl = and cos 9=- ^ , (6.53) 

R R 

where R is the length of the noisy Bloch vectors. Explicitly, 

Q = 2 cos 2 9 [1 - R 2 + 2p sin 2 9\ . (6.54) 

It is easy to see that, given the ranges 9 G (0, 7r/2) and p G (0, 1/2], we have Q > 0, which 
implies that Bob should always apply the non-unitary procedure A. The optimal performance 
is then obtained by substituting the identities (|6.53|) in the first line of Eq. (|6.50| ), which 
gives 




W'^^H' ^.d^. ' (6 ' 55) 

As expected, this is precisely the optimal fidelity found for this problem in ||. 

It should be clear that our tracking strategy can be similarly applied to the stabilization 
of quantum states different from those of Eq. ( |6.51| ), prepared with non- uniform prior prob- 
abilities and undergoing noise dynamics different from dephasing, in any case still providing 
optimal stabilization. It thus represents a significant extension of the results in H. 



6.4.4 Perfectly tracking quantum states 

In this section we evaluate the performance of our strategy in circumstances where tracking 
is known to be perfectly achievable. It will be convenient to split our analysis in two, namely, 
the case of two pure target states and the remaining cases (in which at least one of the target 
states is mixed). 



6.4.4.1 Pure target states 

In appendix p.l| we prove a corollary of Alberti and Uhlmann's theorem stating that a CPTP 
map A perfectly transforming a pair of quantum states pi (i = 1, 2) into a pair of pure states 
Pi exists if and only if p^ are also pure and 9 > 9. Since our tracking strategy is optimal 
(cf. Theorem |6.1|) , we can infer from Alberti and Uhlmann's theorem that it implements 
tracking with unit fidelity whenever R = 1 and 9 > 9. This is explicitly verified in the 
sequence, where the indicator function and the figure-of-merit for pure state transformations 
are computed. 
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We start using Eq. ( |B.46j) with R = 1 (pure source condition) to construct the indicator 
function Q from Eq. fl6.8|) . After some straightforward manipulation, we obtain 

Q = 8^712 sin9 cos 9 sin (0 -9) . (6.56) 

For our purposes, the only meaningful feature of Q is whether it is strictly positive or not, 
in which case the above expression is equivalent to 

U = 2 (0 - 9) , (6.57) 

since 8, 9 e (0, tt/2] and 7Ti,7T2 G (0, 1). Recall that Q is the indicator function obtained in 
Sec. |6.2.2.1| , Eq. (|6.4|), via an heuristic argument. 

The figure-of-merit, in turn, can be obtained from Eqs. ( |6.17 ) and ( |6.23|) to be (3hs) = 1 
(if Q > 0) and 

1 1 



(9hs) = g + 2 V ^ + ^ + 2711712 cos ( 29 ~ (6 - 58) 

(if fl < 0). Note, however, that (3hs) = 1 if # = # (i.e., Q = 0), in such a way that we can 
write 

< fts > = {<4> if °e~<e ' (659) 

The first line of Eq. ( |6.59|) is exactly the content of Alberti and Uhlmann's theorem ap- 
plied to pure state transformations, whereas the second line establishes the optimal achiev- 
able average fidelity when perfect pure state transformation is impossible. 

In conclusion, besides representing a construction of Alberti and Uhlmann's map A 
for perfect pure state transformations, our tracking strategy also gives the unitary map 
(procedure B) that optimally approximates impossible pure state transformations. 



6.4.4.2 Mixed target states 

The requirement of perfect tracking does not restrict the target states to be pure. In fact, 
the more general form of Alberti and Uhlmann's theorem states that for any given target 
states Pj, there exists a CPTP map A that implements perfect tracking from all source states 
Pi satisfying 

||pi-^ 2 ||tr < \\pi-tp 2 \\tr (6.60) 

In contrast to the previous section though, our tracking strategy is generally not a construc- 
tion of the map A in this case. As mentioned before, this is a consequence of the fact that 
our figure-of-merit is not as well motivated in the case of mixed target states. For example, 
in situations where perfect tracking is possible, the resulting average Hilbert-Schmidt inner 
product does not achieve its maximal value. This is further explored next. 
Any CPTP map C implementing perfect tracking must satisfy 

7Ti Tr [C(pi)Pi] + 7r 2 Tr [C(p 2 )p 2 ] = tti Trp^ + ix 2 Tr pi . (6.61) 



Our strategy, though, does not arise from an attempt to enforce Eq. (|6.61|) , but instead to 
maximize its lhs (cf. Sec. |6.2.1| ). Although these actions are equivalent in the case of pure 
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Figure 6.6: (Color online) Although perfect tracking pi — > 'p i is physically allowed, the resulting 
average Hilbert-Schmidt inner product [the rhs of Eq. ( |6.61| )] is only 0.82 for this transformation. 
Our tracking strategy C attempts to maximize this number, finding a different transformation which 



gives an average Hilbert-Schmidt inner product [the lhs of Eq. (6.61)] equal to approximately 0.89. 
As a result, C does not implement perfect tracking. 



target states [the rhs of Eq. (|6.61| ) equals 1, which is precisely the maximum value of its 
lhs for states satisfying the criterion of Eq. ( |6.60| )1, for mixed target states this equivalence 
is lost. In this case, the lhs can typically be made greater than the rhs by employing an 
operation C that elongates the source Bloch vectors to nearly pure states, as illustrated in 
Fig. |6.6| . As a consequence, the maximization of our figure-of-merit leads to a departure 
from the perfect tracking operation. 

Yet, recall that the average Hilbert-Schmidt inner product lower bounds the average 
fidelity and as such, its maximization has some beneficial impact in implementing tracking, 
in the sense that it ensures that the resulting average fidelity is no less than the maximal 
average Hilbert-Schmidt inner product. 



6.4.5 State-dependent Cloner 



One of the most celebrated results in quantum information science is the "no-cloning theo- 
rem" ||165| , 166 1, which establishes the impossibility of copying an unknown pure quantum 
state. Since its inception in the literature, a lot of work has been done on the topic, both 
extending its range of applicability as well as attempting to weaken its impact in practical 

for a review) 



applications (see |175 



Remarkable results in each of these directions are the 



"no-broadcasting theorem" for noncommuting mixed quantum states ||167|| and the Buzek 
Hillery optimal quantum cloning machine [|1 76f . 



In this section we consider a state-dependent cloning task introduced in Ref. 



We 



will show that our tracking strategy provides a straightforward derivation of the optimal 



cloning fidelity obtained in that paper. Following ||177 |, let 



| a) = cos</> |0) + sin0 |1) 
1 6) = sin0 |0) + cos0 |1) 



(6.62a) 
(6.62b) 



for (f) G [0, 7r/4), be the only two possible preparations of a single-qubit, each of which 
occurring with probability 1/2. The cloning task is to output the two-qubit state \aa) = 
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| a) (8> | a) if the initial preparation is \a) or \bb) = |6) <g> \b) if the initial preparation is \b). In 
|177| , a unitary transformation U was obtained such that the figure-of-merit (the so-called 
"global fidelity") 

F g = 1 (|(aa|£/|a0)| 2 + \(bb\U\bO)\ 2 ) (6.63) 

is maximal. 

The key point that allows the application of our tracking strategy here is that, although 



the unitary evolution U acts on the Hilbert space of a two-qubit system, it was shown in ||177| , 
Appendix B] that the maximizing U is such that U \ a0) and U \ b0) lie in the two-dimensional 
subspace spanned by {\aa) , \bb)}. Therefore, we can regard this cloning as a transformation 
from the two-dimensional subspace spanned by { |a0) , \b0)} to the two-dimensional subspace 
spanned by {\aa) , \bb)}. By this same argument, we could have even relaxed the condition 
that the system to be cloned is a qubit. 

Let |si) and |s 2 ) (\t±) and 1^)) be the fictitious qubit source (target) states, and let 29 
{29) be the Bloch sphere angle between them. Then, we must have 

(si\s 2 ) = (aO\bO) = sin (20) = cos#, (6.64a) 
<*i|*a> = (aa\bb) = sin 2 (20) = cos#. (6.64b) 

From the above equations, the angles 9 and 9 can be computed in terms of 0, and the 
optimal value of F g is given by the optimal fidelity for tracking between pure qubit states, as 
described in Sec |6]4 A. In particular, note that for the present problem, a valid indicator 



function is the one proposed in Eq. ( |6.57| ) 



Vt = 2 arccos [sin (20)] — 2 arccos [sin 2 (20)] < 0, (6.65) 

where the inequality holds for the specified range of 0, implying that the optimal fidelity is 
given by Eq. ( |6.58[ ) with the proper values of 9 and 9, explicitly 



(Sks) = \ + \ + n% + 2n 1 n 2 cos n . (6.66) 

For it i = 7i2 = 1/2, the above formula can be shown to be precisely the same as Eq. (38) of 
[ 177 1, which gives the optimal global fidelity of the doner. Thus we have not only reproduced 



that previous result, but also determined how it is optimally modified to incorporate an 
unequal probability of preparation of \a) and \b). 

Finally, let us just mention that the resulting optimal tracking unitary operation (call 
it W) is not quite the optimal cloning unitary operation U appearing in Eq. ( |6.63| ) and 
detailed in |L77j (U and W do not even act in Hilbert spaces of equal dimensions). Instead, 



W constrains how U acts on the states of the form |-0O), but to fully specify U we would need 
to choose U 1 11) and U |01) such that U is a unitary matrix. Since this choice is not unique 
and does not affect the fidelity, we can say that W contains all the essential information 
associated with the optimal cloning map. 
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6.5 Tracking with a Control Loop 

Although the strategy introduced in Section |6.2.2| has been tailored to correspond to a CPTP 
map, so far no insight on how such a map can be physically implemented has been given. In 
this section we provide a realization in terms of a quantum control scheme. Namely, proce- 
dures A and B are shown to have the structure of closed and open loop control, respectively. 

We start by giving a possible Kraus decompositions for the CPTP maps representing our 
strategy. This is relevant here because the Kraus form of a CPTP map enables us to interpret 
that map as some generalized quantum measurement (with no record of the outcomes) j21 |. 



For Q > 0, the transformation C(pi) = UD(VpiV*)W from procedure A can be written as 
C{ Pi ) = (UM 1 V)p i (UM 1 V)^ + (UYM 2 V) Pl (UYM 2 vy , (6.67) 

with 

M l = cos |+X+| + sin |-)H , (6.68a) 

M 2 = sin (?^L) \+)(+\ - cos (?^±V) |-)H , (6.68b) 



where x an d V are defined such that sinx = p 3 , cosx = a/1 — /U3, sin r] = p 2 and cos r] = 

For Q < 0, the transformation C(p«) = UV P iV^W from procedure B is automatically in 
Kraus form, with a single Kraus operator UV. 

We interpret these results as follows. First for Q > 0, the unitary V is applied to 
the system and then a generalized quantum measurement with operators Mi and M 2 is 
performed. Conditioned on observing the outcome '2', a Pauli Y is applied to the system, 
followed by the unitary U. If the outcome is '1', the unitary U is applied straight away. 
Due to this measurement-dependent dynamics (feedback), procedure A can be regarded as 
a closed loop control scheme. 

Note that the measurement operators Mi and M 2 are not projections, so the implemen- 
tation of such a measurement requires the enlargement of the Hilbert space (by interaction 
with an ancilla), with subsequent (projective) measurement of the ancilla. Fig. |0] shows a 
possible circuit model for procedure A. 

For Q < 0, there is clearly no measurement involved, hence the control strategy is 
implemented independent of acquiring extra information from the system. For this reason, 
procedure B can be regarded as an open loop control scheme. 



6.6 Discussion and Conclusions 

In this chapter we have introduced a simple quantum version of a common classical control 
problem named tracking. Our quantum tracking problem consists of determining how to 
optimally enforce a certain dynamics to a qubit system, when the initial preparation of the 
qubit is uncertain (as modeled by a pair of states occurring with given prior probabilities) 
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Figure 6.7: A circuit model illustrating the feedback structure of procedure A. In the figure, 
|±i) = (|0) ± i|l))/\/2 are the eigenvectors of the Pauli matrix Y, H is the Hadamard gate 
and Zq = exp (— i9Z/2). The highlighted circuit entangles the main system with the ancilla and 
projectively measures the ancilla in the basis {\+i) , \— i)}- This induces a non-projective dynamics 
of the main system, and for this reason this block is referred to as a "weak measurement". If the 
measurement outcome is then the unitary transformations Y and U are applied to the main 
system; otherwise, only U is applied. 



and the desired dynamics depends on the actual preparation. We presented an optimal 
quantum tracking strategy. 

The tracking problem studied here is sufficiently general to provide an unifying approach 
to many problems in quantum information science as special cases. For example, some cases 
of quantum state discrimination, quantum state purification, stabilization of qubits against 
noise and state-dependent quantum cloning were explicitly shown to be instances of quantum 
tracking. As such, previously known quantum limits in the realization of these tasks were 
recovered via the application of our tracking strategy. Likewise, our tracking strategy can 
be used to obtain new and improved limits in the realization of other impossible quantum 
machines. 

The derivation of our strategy was largely dependent on the fact that our figure-of-merit 
(the averaged Hilbert-Schmidt inner product) is linear in C, which, in turn, is constrained 
to be an element of the convex set of CPTP maps acting on qubits. This implies that the 
optimal map C belongs to the subset of extreme points, which has been fully characterized in 
|65| . Thanks to a parametrization of these extreme points, the resulting optimization prob- 
lem could be handled analytically when a few mild assumptions (supported by numerical 
observation) were made about the form of the optimal solution. The optimality was safe- 
guarded a posteriori via an argument based on the SDP structure of the tracking problem. 

Analytical solutions for generalizations of the tracking problem studied here (e.g., other 
figures-of-merit and/or larger dimensional quantum systems) seem to require a modified ap- 
proach from the one adopted here. For example, had we chosen to proceed with a better 
motivated figure-of-merit for mixed targets, such as the average fidelity, we would still have 
the guarantee that the optimal C is an extreme point, however optimality results about a 
possible guess would be harder to derive, since it is not known if/how the resulting optimiza- 
tion problem can be cast as a SDP when source and target states are mixed. Alternatively, 
we could have chosen, for example, to minimize the average trace distance, which can be 
cast as a SDP ||131| , |135|| . However, the trace distance is not concave in C, in which case 



its minimum need not be an extreme point. Finally, had we kept our linear figure-of-merit 
but generalized from qubits to qudits for d > 2 (or to multiple qubits), we would face the 
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problem that the extreme points of the set of CPTP maps on higher dimensional matrix 
algebras are not well characterized. 

A possibly simpler generalization is to preserve low dimensionality of the quantum system 
and linearity in the figure-of- merit, but allow for a larger number of possible sources and 
targets. In principle, this problem can be approached following exactly the same lines as 
adopted here. In fact, it is not difficult to see that a particular case of this more general 
problem can already be considered solved given the results of this chapter. Consider we are 
given two sets Si and S2, respectively with rii and 712 elements (let N = ni + 72,2), of qubit 
density matrices Tj (j = 1, . . . , N), and want to send every element of Si to p { for i = 1, 2. 
In analogy with Eq. (|6.2j) , define the figure-of- merit 

ni N 

(%s) = ^g J Tr[C(r J ),p 1 ]+ £ Qj Tr [Cfo), p 2 ] , (6.69) 

j=l j=rn+l 

where the positive numbers qj set the priorities of each transformation, and Ylf=i Qj = 1- 
Due to the linearity of the trace and of quantum operations, Eq. ( |6.69| ) can be rewritten 
exactly as Eq. (fTD with m = Qj, ^2 = E^n 1+ i Qj> 



j ni ^ N 



J=l j=ni+l 



Note that 7Ti, 7r2 > 0, 7Ti + 7r2 = 1 and pi, p 2 are valid density matrices. So, for % — 1, 2 and 
j = 1, . . . , JV, the problem of optimally approximating the iV-state transformation Si — > p« 
with priority qj is equivalent to optimally approximating the 2-state transformation pi — > p^ 
with priority 7Tj. 
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7.1 Introduction 



So far in this thesis, we have looked at the problem of transforming quantum states with a 
single controlled intervention. In the last two chapters, we have seen how this can be useful 
to stabilize the unknown state of a quantum system undergoing some pre-characterized noisy 
dynamics. Our strategy consisted of waiting for the system to experience all the noise, and 
only after that to apply a quantum operation that optimally transformed the noisy states 
into the original states (or, in the case of tracking, to other desired states). In this chapter 
we ask whether we can do any better if instead of waiting for the full noisy evolution to take 
place, we actively interact with the system multiple times while the noise is still in action. 



This is illustrated in Fig. 7.1 
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Figure 7.1: Schematic of the multi-step tracking problem. Each block represents a con- 
trolled intervention that attempts to optimally track between sequences of density matrices as the 
system travels through the noisy channel £^ N ~^ 0...0 £^ o f W 



In classical systems, a multi-step setting as the one of Fig. [Fj] is the basic setup for 
discrete time feedback control, and is known to be a very effective scheme. Each intermediate 
step estimates the current state of the system, which is then suitably modified on the basis 
of this information. Moreover, in situations where the forthcoming noise is known, we can 
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use the information from the state estimation step and the information about the future 
dynamics in order to make the system more resilient against the noise to come. 

The same ideas apply for quantum systems, in this case, however, one has to consider that 
quantum measurements typically introduce noise on the state being measured. Because of 
this, it is generally not clear whether the application of multiple measurements is detrimental 
or beneficial for the control of quantum systems. 

In this chapter, we attempt to approach this question by looking for a sequence of con- 
trollers . . ., that provides an improved performance than that obtained in the 
case of an optimal single controller at the end. Clearly, the latter is recovered if the N — 1 
first controllers are equal to the identity map and the last controller is optimally chosen. 
Our problem is to decide whether a variation of this sequence exists (and how it can be 
constructed) such that the tracking task can be accomplished with higher average fidelity. 

The results of this chapter are preliminary and are presented in Sec. |7.2| , which is divided 
as follows: We start introducing the basic principle of dynamic programming and outlining 
how it can be used (along with an heuristic argument) to construct a multi-step tracking 
scheme assuming that an analytical optimal solution for the single-step tracking problem is 
known. In subsections [7.2. 1| and [7.2.2| these ideas are put in practice to approach the general 
case of tracking between arbitrary sequences of density matrices in N steps. In Sec. [7.2. 3| , 
we restrict to multi-step tracking for pairs of qubit states in an arbitrary number of steps 
and provide some numerical results for the case N = 2. 



7.2 Multi-step Tracking via Optimal Single-step Track- 
ing 

In this section we describe how to generate a "good" sequence of operations for the multi-step 
problem. Due to some simplifying assumptions to be made along the way, these sequences 
cannot be guaranteed to be optimal. Nevertheless, they are usually "good" in the sense 
that, in many cases, they produce higher fidelities than those obtained with a single optimal 
controller at the end. 

We start with a broad description of the method, which essentially consists of recursive 



applications of the following fundamental idea from dynamic programming |1 78| , |1 79 



Principle of Optimality. In an optimal sequence of controllers, whatever the initial state 
and the optimal first control may be, the remaining controls constitute an optimal control 
sequence with regard to the state resulting from the first control. 

We shall divide our analysis in two parts. First we look at the sequence of controllers from 
the end to the beginning. The application of the principle of optimality in this backwards 
direction — surmounted with an heuristic argument to be explained in the next section 
- will reveal each controller of the multi-step sequence as a single-step operation from 



certain source states pf 1 to certain target states ~g\ n \ as illustrated in Fig. |7.2| . Furthermore, 
assuming that the analytical solution to this problem is known, we can obtain the explicit 
form of the function 6, relating the target states of the n-th correction with the source and 
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Figure 7.2: Multi-step tracking as a sequence of single-step tracking operations. Each controller 
is seen as an optimal tracker from states pf 1 ^ to ~g\ n ^ . 



target states of the next correction, i.e.. 

_(n) 

Qi = 



6 [Pi iQi 



(7.1) 



N — n iterations of the above relation implicitly define a function s such that 

■JP) _ ( Jn+l) ( (n+2) (n+2) 
Qi - 6 [Pi i 6 [Pi ,Qi 



(n+1) (n+2) (TV) 
3 \ Pi i Pi 1- • • iPi J Pi 1 > 



(7.2) 



which shows that the target states g\ n ^ are expressed as a function of the (so far undeter- 
mined) source states at each step. 

The states pf 1 ^ are determined in the second part of our approach, in which the control 
sequence is studied in the standard forward direction. Here, the analytical solution of the 
single-step tracking problem will yield a function / such that 



(n) . ( (n-l) -(n-l) 

Pi-flPi ,Qi 



(7.3) 



Just as before, n — 1 iterations of this relation induce the function j , relating pf 1 ' with the 
given source density matrices p[ = pi and the targets qI' '"' n x ' 



(n) ^ / _(1) -("-!) 

p\ -7 [Pi, el' ,...,q\ 



(7.4) 



Together, Eqs. ( |7.2| ) and ( [7.4|) specify every and of 1 ' , and hence a sequence of iV 
single step controllers. In the following, the procedure described above is explicitly applied 
and the heuristic argument giving rise to Eq. (|7.1| ) is explained in detail. 



7.2.1 Backward direction 

Consider the multi-step sequence below 



— CW — e W — £(W-2) _ C (JV-1) _ £(N-1) _ C (N) — 



Whatever the sequence of noises £^<-' N ^ and optimal controllers C^''"' N ^ are, the initial 
density matrices pi are obviously transformed into other density matrices after experiencing 
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the action of the operations in the dashed box. Following the convention set up in Fig. 7.2 , 
we shall denote by the output of this sequence of operations. It then follows from the 
principle of optimality that is the optimal single-step operation for the transformation 

(TV) 
Pi ^ Pi- 

To see that a similar conclusion can be drawn for C^ N ~ l \ consider the following diagram: 



— CM - — £(W-2) 



C (JV-1) _ £(N-1) 



Pi 



C (N) - 



As before, let p[ N ~^ represent the output of the unknown sequence of operations within 
the dashed box. Here, the principle of optimality establishes that the combined operation 
C^) o £( Ar_1 ) o C^ -1 ) has to be optimal for the transformation p,-^ ~^ i— > TV Since has 
already been determined [in terms of p\ ], and f^ -1 ) is not a controllable operation, we 
are only left with the task of determining the optimal C^ -1 ). This is done by solving the 
optimization problem 



max Y 7r.Tr \(C^ o o C^) (pf~ l) ) 



C(iV-l) g gsct 



(7.5) 



which, with some simple algebra can be re-expressed in terms of the Choi matrices of 
and as follows: 

W-i) T 



max Tr 

C (JV-1) 



where 



<8> 7Tj£> 



(7V-1) 



subject to C (JV_1) > and 



_(AT-1) 

Qi 



Tr 



2,3 



Pi 



Tr 2 C^ 1 ) = l d , 
(7.6) 

(7.7) 



and Tr 2i 3 denotes the partial trace operation over the second and third subsystems of di- 
mension d. 

Up to here we have been closely following the dynamic programming recipe to optimally 
solve the multi-step tracking problem. To continue along these lines, though, we would now 
have to face the difficulty involved in solving the optimization problem (|7.6|) . Although this 
problem resembles the SDP maximizing the average Hilbert-Schmidt inner product between 
sequences [p\ N ]f =1 and [g[ N ]f =1 [compare with Eq. (|4.43|) 1, this is just a superficial 
similarity. In fact, the matrices ~g\ N ~ 1>> also depend on ^ N ~^ because they explicitly depend 
on <£( N \ which, in turn, depends on pf' = (f^ -1 ) o C^ N ~^)(p\ N ^). 

Our work around is to give up global optimality by relying on a simplifying assumption. 
Assuming that we know how to optimally solve the single-step tracking problem (which is a 
SDP), our goal is to exploit its solution to build a "locally optimal" multi-step scheme. For 
that purpose, we simply ignore the dependence of ~g\ N ^ on C^" 1 ), regarding it as a fixed 
(but arbitrary) state. As a result, we choose (t*^ 1 ) to be the optimal single-step tracking 
operation^ from [p\ N ]j=i to [q\ N ]j =1 - 



1 There is, however, a subtlety: Although the tensor product structure inside the partial trace of Eq. (7.7) 
guarantees that ~q\ N > 0, in general ~g[ N is not normalized. As such, one should make sure that the 
optimal analytic solution to be used for is still optimal if the target density matrices arc unnormalizcd. 

Recall that this was seen to be the case for the optimal single-step tracker constructed in Ch. ^ 
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It might be helpful to consider what has been learnt hitherto. From the first step, we 
have seen how can be constructed as a function of p\ N ^ . From the second step, we have 
seen how C^ -1 ) can be constructed as a function of p\ N and p\ [via <£^]. We can now 
follow with this backwards approach, at each step characterizing each as an optimal 
single-step tracking operation from arbitrary density matrices p[ to targets £^ satisfying 



_(n) 

Qi 



Tr 



2.3 



(71 



for n = 1, . . . , N — 1 and g\ N ^ = p { . Given that (£( n+1 ) is a function of p[ n+1 ^ and g\"'' r±J , the 
above gives a more explicit form of relation ( |7. 1| ) . 

As noted in Eq. (|7.2|) , this recursion relation provides a way to obtain the targets of each 
controller as a function of the arbitrary sources introduced to the right of the controller at 
hand. Since each controller is fully specified by the knowledge of its sources and targets, 
we just need to self-consistently determine the (so far) arbitrary sources pf 1 ' in order to 
completely characterize a sequence of controllers. This is the content of the following section. 



7.2.2 Forward direction 



It is much easier to construct relation ( |7.3| ). In fact, this is simply the forward evolution 
with CPTP maps implementing optimal tracking p^ i— > gf 1 ' at each step, 



(7.9) 



Re-expressed in terms of the Choi matrices of each map, the above reads 



(n+1) _ rp 
Pi — lr l,2 



<£^ T ® l) (p^ ® (gW) 



(7.10) 



which is precisely of the form of Eq. (|7.3j ), since is a function of p^ and g\ n \ 

The set of Eqs. ( [7.1 0|) can now be solved simultaneously with the set of Eqs. ( |7.8|) to give 
each matrix p[ n ^ and ~g^ ■ From the solution of this system, a sequence of controllers 
can then be obtained. 



7.2.3 Multi-step tracking for a pair of qubit states 

Let us now look at the more concrete example of tracking for a pair of qubits in N steps, for 
which we shall employ the analytical solution for the single-step tracking problem obtained 
in Ch. p. In the following subsections we explicitly write the set of equations (|7.8|) and (|7.10D 
in terms of Bloch vectors and discuss some numerical results for N = 2. 
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Equations for backwards direction. Eq. (|7.8|) is evaluated with the use of the following 
parameterizations: 



_(n+l) (n+l)*, . Wn+1) 



(n+1) 



(7.11) 
(7.12) 



2C(" +1 ) =14 + 4 n+1) l 2 ® 

3 3 

2(S<»> = 1 4 + £ t$ n) l 2 ® («#° •*)+£ Af (/if • <x T ) ® • o-) , (7.13) 



where, in Eq. ( |7.11[) , we kept with the convention from Ch. |6] (cf. page |107|) of writing 



R 



(n+l) 



to the "Bloch vector" of TTigf 1 . Furthermore, we introduced the constant c^ 1 ' to 



account for the fact that g[ n is not normalized, as explained in Sec. |6.3.2| [however, we 
have cf^ = 1 to comply with the fact that g\ = pj. Eq. ( |7.12| ) gives the general form 



of the optimal single-step tracking solution, and was derived in Appendix |D.3j . The versors 
v^ +l \ uj 1 and the scalars fJij and s^ 1 are functions of the Bloch vectors R[ n+1 ^ 

and J2- n+1 ^ as described in Appendix D.3 . Finally, Eq. ( |7.13| ) gives the general Choi matrix 



of a CPTP map (cf. Sec. |2.2.3.6| , page ^). Since the noise is assumed to be known, the 
parameters Xj and are considered given. 

Substituting the above formulas in Eq. (|7.8|), after some algebra we find that 



R 



(n) 



(7.14) 



k=l 



where Q ™ is a real vector carrying all the undetermined parameters: 



Qi n+1] ■= E4 n+1) H n+1) • ^ n+1) ) - >+1) 



V; 



(7.15) 



A more explicit form can be obtained by evaluating the dot product with the aid of Eq. ( p.31| ): 
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where, for brevity, we have omitted the index (n + l) from every symbol. 



Equations for forward direction. Evaluation of Eq. (|7.10| ) is accomplished with Eqs. ( [7.13j ) 
and (|7.12|) for (B^ and <£( n \ respectively, and 



(7.17) 
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for the source density matrices, where gives the Bloch vector of the density matrices 
pf 1 . After some manipulation, we find 



k=l 



(7.18) 



where now, P^ is the real vector carrying in the undetermined parameters 



p( n ) ._ s ( n )„,( n ) 
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Computing the dot products with the aid of Eqs. (p.30|) , gives 
I Si + z// r 
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where, once again, we omitted the indices (n) for brevity. 



System of Equations. In summary, combining Eqs. ( |7.14 ) and (|7.18 ) we obtain the 
following non-linear system of 21 (N — 1) vector equations with 21 (N — 1) vector variables 



From Eq. (|7lj ) 
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where and Q- n+i ' 1 are respectively defined in Eqs. (|7.19|) and ( [7.15|) for 



9 k 



n 



(7.21) 
JV-1. 



Due to the non-linearity posed by the complicated dependence of the vectors Pi and 
Qi on the Bloch vectors, there exist many different solutions for the system ( |7.21| ). This is 
clearly noticed when we use Matlab (function f solve) to search for a solution in particular 
cases. In order to run the numerical solver, it is necessary to provide a initial guess of 
what the solution is, and we found that by varying the choice of this starting point the 
algorithm converges to different solutions (or does not converge at all). In general, different 
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solutions for the same problem lead to different values of fidelity, and sometimes we end 
up converging to solutions that give fidelities smaller than the optimal single-step tracking 
fidelity! Nevertheless, by varying the initial conditions, we have always been able to find 
solutions which are at least no worse than optimally correcting only at the end. 



7.2.3.1 Numerical solution for some two-step cases 



In this section we look at the numerical solution of the system ( [7.211) for N = 2 and a control 



task of stabilizing (with uniform priorities 7Ti = tt2 = 0.5) a pair of pure qubit states lying on 



the XZ plane of the Bloch sphere and straddling its equator by an angle ±7r/4 [cf. Eqs. ( |5.2|) 
and ( |5.3[ )]. The noise in between the two corrections is taken to be a diagonal non-unital 
extreme point of the set of CPTP maps. More specifically, it compresses the Bloch sphere by 
Ai, A 2 and A 3 along the x-, y- and z- directions, respectively (1 representing no compression, 
and representing full compression), and translates it by £3 along the z-axis. The extremal 
character is imposed by choosing A3 = A1A2 and £3 = \/ {1 — A^)(l — A|) [cf. Eqs. ( |2.51| )1. 
With this choice, the noise can be characterized with only two parameters. Each one of 
the three plots in Fig. |7l| presents the solution of the system with respect to the choices of 
parameters: Ai x t 3 , A2 x £3 and A3 x t 3 . For the presented results, we provided an initial 
condition corresponding to a sequence of "do-nothings operations" . 

Each point in the plots represents a numerical solution of system ( |7.21| ) for the values 
of noise parameters indicated in the axis. The crosses (in blue) indicate those cases for 
which the possibility of using a correction before the action of the noise was found to be 
advantageous. Although not visible from the plots, the advantage of the 2-step schemes over 
the optimal 1-step schemes was found to vary up to a maximum of approximately 10%, which 
occurred at the point marked in each plot with a circle (Ai = 0.70, A2 = 0.46, A3 = 0.32 and 
£3 = 0.63). It is a general observation (for which we do not have a satisfactory explanation) 
that we only produce 2-step strategies that over-perform the optimal 1-step strategy for 
intermediate to large values of the translation £3. 

The dots (in red) represent those noises for which the numerical algorithm converged to 
operations giving fidelities smaller than those given by the optimal single-step scheme. Of 
course, this simply reflects the mentioned weakness of the method of converging to subop- 
timal strategies. In this case, this could be remedied by repeatedly running the numerics 
with different initial conditions until a fidelity higher than (or equal to) that of the optimal 
single-step scheme was achieved. Finally, the empty regions correspond to the noises where 
the convergence occurred to operations that recover the optimal single-step case, i.e., to 
do nothing at the first step and implement the optimal single-step tracking strategy in the 
second step. 

By looking at the details of the operations found for each noise, we found that we generally 
obtain correction schemes composed by two unitaries or a unitary in the first step and a 
non-unitary in the second stepQ. Although we cannot guarantee that the optimal multi-step 
scheme would not be composed of two non-unitary corrections, the fact that this is never 



2 As a matter of fact, sometimes we obtain correction schemes formed by measurements in the two steps, 
but by varying the initial condition we have always been able to find an alternative strategy with a unitary 
in the first step that over-performed the strategy with two non-unitaries. Once again, this illustrates that 
our method can converge to suboptimal solutions. 
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Figure 7.3: (Color online) Two-steps stabilization for a pair of qubits states under extremal 
noise. The crosses (in blue) indicate the values of the noise parameters for which the solution 
of system ( 7.2l| ) converged to give two quantum operations producing a higher fidelity than in 
the case of a single optimal correction at the end. The dots (in red) illustrate a weakness of the 
method: for the corresponding noises, the resulting corrections turned out to give a smaller fidelity 
than that of a single optimal correction at the end. For the remaining empty regions (within the 
domain of physical noises), the solution of the system converged to give the same fidelity as that 
obtained by correcting only at the end. The circled crosses in each plot indicate the values of the 
noise parameter for which the highest advantage of approximately 10% was obtained with respect 
to the optimal performance of a single correction at the end. 



obtained from our approach suggests that the addition of any extra-noise before the last step 
is generally prejudicial to stabilize the states of a qubit. 

Similar conclusions were drawn by Gregoratti and Werner [ 147 , 148(1 , who considered 
error correction strategies that aimed at stabilizing the entire Bloch sphere and, different 
from our case, allowed the possibility of making measurements on the environment that 
induced the noise. In the case of qubits, they proved that the best strategy was to correct 
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just at the end. 

In our case, this is not quite true. Since we only require stabilization of a pair of states, 
we find that sometimes it is advantageous to rotate these states along the Bloch sphere and 
thus make them less susceptible to the noise to come. This is precisely what happens in the 
case of the blue crosses in Fig. fT3[ However, as in Ref. |[L48|| , it seems to be better to leave 



any measurements to the last step. This hypothesis was also tested and verified for many 
other noises and pairs of qubit states, but further investigation of this matter is necessary 
before a general claim can be made. 



8 

Conclusion 



Motivated by a long list of successes in the classical framework, the use of feedback control 
in quantum systems is a promising direction for the development of new quantum technolo- 
gies. However, distinguishing quantum and classical feedback is the fundamental fact that 
quantum measurements intrinsically disturb the system being measured. As a consequence, 
a naive (or classically inspired) use of feedback in quantum systems may contradict one's 
intentions of gaining control, and instead lead to the addition of copious amounts of noise. 
In this thesis, we investigated optimal ways of measuring finite dimension quantum systems, 
in such a way that the balance between information gain and back-action noise can be made 
favorable for the application of feedback control in the quantum domain. In what follows, 
we summarize our main results and outline some directions for future work. 

The specific control problem we focused on was that of inter-converting between sequences 
of density matrices. This problem subsumes many situations of practical interest where one 
attempts to drive the dynamics of an initially unknown state: while the input (source) 
sequence models the initial uncertainty, the output (target) sequence models the states 
one would like to obtain conditioned on the identity of the initial state. Tasks such as 
optimal quantum state-discrimination, state-dependent quantum cloning and quantum error 
correction can all be formulated in terms of this problem. 

In order to guide the design of optimal feedback schemes and quantify their merits, we 
started in Ch. ^ studying distance measures for the space of density matrices. Our contri- 
bution to this topic was the proposal of an alternative definition of fidelity between mixed 
states. One of the most appealing properties of our "new fidelity" is that it is significantly 
easier to compute than the Uhlmann-Jozsa fidelity. In fact, it only requires the computation 
of traces of some products between density matrices, whereas the traditional fidelity generally 
involves a more expensive matrix diagonalization procedure. In addition, our new fidelity 
satisfies all of Jozsa's axioms, gives rise to a metric and is jointly concave. An important 
byproduct of the joint concavity of our fidelity was the establishment of the joint concavity 
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property of Uhlmann-Jozsa fidelity in the case of qubit states, settling an open problem in 
the field. In this chapter, we have also reviewed some known metrics on the space of density 
matrices, and showed how metrics for the space of sequences of density matrices can be built 
from those. 

Equipped with the above provisions, in Ch. |]we set out assembling our control problem as 
a particular type of convex optimization called semidefinite programs. This is a well-studied 
class of optimization problems for which efficient numerical methods exist to determine 
the optimal solutions. After showing some algebraic tricks that allowed the minimization 
(maximization) of several distance (closeness) measures to be written as SDPs, we exploited 
these methods to obtain optimal controllers. Based on these results, we investigated the 
sensitivity of the controllers with the choice of distance measure. On top of identifying 
some cases where the same controller optimizes several measures, we developed and tested 
a method to estimate the "compatibility" between optimal controllers optimizing different 
distance measures. In this regard, there are opportunities for refinement and future work; 
for example, we should enlarge the numerical samples used to compute the compatibilities 
and check whether the same conclusions still apply. 

Our analytical investigations started in Ch. || where we considered the problem of sta- 
bilizing the state of a single qubit prepared in one of two non-orthogonal states undergoing 
dephasing noise. We proposed two different types of feedback strategies to approach this 
problem: The first was based on the classical concept of exploiting the measurement to 
discriminate between the two possible initial preparations, and then follow with a suitable 
repreparation of the system. The second was based on the idea that non-orthogonal states 
are fundamentally indistinguishable. As opposed to attempting to discriminate the two 
states, we used a quantum measurement to learn about how the noise affected the system, 
and fedback to counter-act the noise. We proved that our classical and quantum strate- 
gies were optimal, in the sense that no other entanglement-breaking-trace-preserving or 
completely-positive-trace-preserving maps, respectively, could produce a higher fidelity for 
the stabilization task. We have also proposed an (arguably optimal) stochastic discriminate- 
and-reprepare scheme, which admitted the possibility of occasional inconclusive results in the 
discrimination step. Quite surprisingly, this scheme produced the same optimal performance 
as the deterministic quantum scheme. This observation motivates some future research to 
find whether this is just a peculiarity of the problem considered here (two qubit states, 
dephasing noise), or else if in more general circumstances we can still achieve the optimal 
performance of quantum schemes by using stochastic discriminate-and-reprepare strategies. 

In Ch. |6| the analytical results from the previous chapter were significantly extended. We 
introduced an optimal strategy for optimally transforming the state of a single qubit into a 
given target state, when the system can be prepared in two different ways, and the target 
state depends on the choice of preparation. This generalizes the results of Ch. |5] in two 
points: Because we now allow the source qubit states to be arbitrarily chosen, they can be 
regarded as the outputs of an arbitrary noise channel. Moreover, due to the arbitrariness of 
the target states, we were able not only to optimally stabilize the states, but also to optimally 
track them while competing against the noise dynamics. For this quantum tracking task, 
we found that feedback control is not always useful, and sometimes it is actually better 
not to measure the system at all. In these cases — which were flagged by an analytic 
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indicator function - - the application of a unitary map after the noise was found to be 
the best strategy. Otherwise, a closed loop scheme giving optimal measurement strengths 
and feedback was analytically constructed. Several applications for our quantum tracking 
strategy were discussed in the framework of quantum information. Attesting the quality 
of the scheme, some optimal strategies for quantum state discrimination, purification, error 
correction and state-dependent cloning were recovered and extended. 

Chapter [F| concluded our scientific results with the proposal (and some preliminary nu- 
merical solutions) of a variation of the tracking problem studied in Ch. |6|. Here, we allowed 
controlled interventions not only after the system was exposed to the noise, but also before 
and during the action of the noise. In general, it is not clear whether the use of feedback 
at earlier stages of the dynamics is beneficial for the tracking goal, since the information 
gain could not compensate the back-action noise imparted on the system by multiple uses 
of quantum measurements. To address this problem, we developed a method (based on dy- 
namic programming and on a heuristic argument) to derive suboptimal multi-step schemes. 
By applying our method to some examples involving 2-step stabilization of a pair of qubit 
states, we noticed that we can usually over-perform the optimal single-step scheme. In ad- 
dition, we found that our 2-step schemes were always composed by some unitary map in 
the first step and (sometimes) a weak measurement in the second step, suggesting that the 
observation of quantum systems along the way may not be advisable. The confirmation of 
this hypothesis, though, is still an open problem for future research. 

Throughout this thesis, we studied optimal strategies for controlling quantum system. 
Optimality, however, is only possible when the details of the control task are known within 
a certain (high) level of accuracy, which for practical applications may not be realistic. For 
example, the noise model affecting the system has always been assumed to be perfectly 
known, giving rise to a sequence of source states onto which we relied for the construction 
of our control strategies. In practice, quantum process tomography is subjected to errors, 
and a quantum channel very accurately characterized at a certain point in time may change 
its properties when the system is actually running. Therefore, an important extension of 
this work would come from the introduction of uncertainties for the source sequence, as a 
result of uncertainties in the noise model that gives rise to it. The determination of control 
strategies in the presence of uncertainties is part of what is called robust control. 

In the context of our quantum tracking scenario, we envisage the following problem: 
Suppose that, as a result of a precarious characterization of the noise model, our knowledge 
of the source sequence is imperfect, which is modeled by a nominal sequence and many 
equally possible source sequences, as illustrated in Fig. |87T| . We would like to determine a 
quantum controller that guarantees that the minimal performance in approximating a given 
target sequence is above some pre-established threshold. In particular, we would like to 
know how high this threshold can be made while still physically achievable. The formulation 
of some instances of this problem as SDPs has already been accomplished, and it is our 
intention to further explore this topic both in the numerical and analytical frameworks. 
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Figure 8.1: Schematic of the robust tracking problem. 
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Appendices to Chapter 2 



A.l The Permutation matrix P 



d4 



In this appendix we give an explicit construct of the permutation matrix P d 4 that establishes 
the equality 

vec (A ® B) = P d 4 (vecA <g> vecP) , (A.l) 

for A and B arbitrary d x d matrices. In this thesis, the need for this matrix arises in 
Sec. |2.2.3.3| , where we construct the Choi matrix of a CP map formed from the composition 
of two CP maps. 

In order to determine P d 4, we first numerically solved Eq. ( |A.1| ) with fixed matrices A 
and B of dimension d = 2, 3 and 4 (notice that P d 4 should not depend on the particular 
choice of A and B). The resulting permutation matrices are shown in Fig. |A.l . 
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(b) d = 3 



(c)d 



Figure A.l: The permutation matrix P d 4 of Eq. ( |A.1| ) for d = 2, 3 and 4. The black marks 
correspond to l's, the remaining spaces are rilled with O's. 



The form of for arbitrary values of d can be inferred from the instances shown in 



Fig. |Alj For example, P d 4 is easily recognized as a block diagonal matrix with d identi- 
cal blocks of dimension d 3 . In addition, after a careful analysis of the generated pattern, 
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Eq. ( A. 2 ) was identified as the correct formula specifying the position 
elements in the first block of P^, 



of the unit 



j = i + 



d(d-i; 



d 2 



d(d 2 



(A.2) 



where i = 1, . . . , d 3 and |_"J denotes the floor function. 

Finally, let us mention that P&a is n °t the same as the so-called "vec-permutation matrix" 
which is defined as the (unique) d 2 x d 2 permutation matrix P such that vecA T = PvecA. 
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Except for Appendix [B.4| , all the appendices presented here appear in Ref . |T| . Appendix |B.5| 



has been modified to include Matlab codes for the computation of the metrics % and 0, 
which were not considered in Ref. 1X1. 



B.l Proof of Proposition [2] 

In this appendix the joint concavity of 5at is established via the proof of Proposition 0. 
Proof. Differentiating Eq. ( |3.25| ) twice with respect to x, we obtain 

d 2 F(x) d 2 f(x) , . „. .d 2 q(x) df(x)dq(x) . , 



dx 2 dx 2 dx 2 dx dx 



where, for convenience, we define the functions f(x) '■— \/l — \\f + xu\\ 2 and g\ 
\J\ — \\s + xv\\ 2 . 



After some computation we find that 

d 2 F (x) 
dx 2 

where 



5lW . 2 _-^!_/^!, ,b,) 



w ■ (r + xu) v • (s + x«) j(x) [w-(r + iu)] 2 / (x) [v ■ (s + xv)f 

X$2\%) '■= * 7-r~s — T~\ 5 5 • 1^5.4) 

/(*)</(*) [f(x)f [g(x)] 3 
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The negative semidefiniteness of d 2 F(x)/dx 2 in the range x G [0, 1] can be observed if $i(x) 
and §2(20 are written in the following alternative form: 



f(x) 



u 



7(x) 



V 



f (x) g (x) 



g{x) 
f(x) 



u • ir + xu 



g{x) 



v • (s + xv) 



(B.5) 

(B.6) 
□ 



B.2 Proof of Super- multiplicativity of 3^ 

To prove that 3"^ is super-multiplicative, we first define := Trp^ and := Trof, such 
that < Tj, Sj < 1 (note that here we use r$ instead of rf as the norm square of r i; likewise 
for si). Straightforward algebra gives 

y/(l - nr 2 )(l - Sl s 2 ) - ^(1 - ri)(l - Sl )(l - r 2 )(l - s 2 ) 
- Tr [px^] y/(l-r 2 )(l-s 2 ) - Tr [p 2 a 2 ] ^/(l - ri )(l - Sl ) . 

A direct application of Cauchy-Schwarz's inequality Tr [piCTi] < ^/r\Si gives 

$n(P1 <8> P2, CTl <S> CT 2 ) - Jjv^l, CTl)3 r Ar(p2, Cr 2 ) > 

v /(l-r 1 r 2 )(l-s 1 s 2 )- v /(l- ri )(l-s 1 )(l-r 2 )(l- S2 ) 
-Vnsi(l - r 2 )(l - s 2 ) - A/r 2 s 2 (l - n)(l - si) . 

The super-multiplicative property is obtained by showing the positive semi-definiteness 
of the rhs of the above expression. This is the content of the following proposition: 

Proposition 6. For < a, b, c, d < 1, we have 

y/(l -ab)(l -cd) > y/(l - a)(l -b)(l- c)(l - d) + y/ac(l - 6)(1 - d) + y/bd(l - a)(l - c) . 

(B.7) 

Proof. First note that if any of the variables equals 1, then the validity of the inequality is 
immediate. For example, let d = 1 so that ( |B.7| ) reduces to 

y/(l - ab)(l - c) > y/b{\ - a)(l - c) . (B.8) 

This is trivially satisfied for all < a, b, c < 1. In what follows, we restrict to < a, b, c, d < 1 
and show that inequality (|B.7|) is equivalent to the standard inequality of arithmetic and 
geometric means (hereafter referred as the AM-GM inequality). This inequality is just an 
expression of the fact that the geometric mean of a list of non-negative real numbers is never 
larger than the corresponding arithmetic mean. 



B.3 Proof of the Metric Property of B[3] and C[3] 
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Apply the substitution a' = 1 — a (similarly for b', d and d'; note that < a 1 , b', d, d! < 1) 
to the inequality ( |B.7| ) and divide the result by \J a'b'dd' to get the equivalent inequality 

+ A + B)(l + C + D) > 1 + VAC + VbD, (B.9) 

where we have defined A = — 1 (similarly for B, C and D; note that < A, B, C, D < oo). 
Squaring the inequality above we find 



A + C + B ± D + AD+BC ^^^^ 



2 2 2 

which is clearly a sum of three AM-GM inequalities. 



(B.10) 
□ 



B.3 Proof of the Metric Property of B[3\ and C[3] 

In the following, we give a new demonstration of the metric properties of -B[3 r ] and C[3] 



see Refs. ||103| , |105|| for the standard proofs). Our proof consists of a simple application of 



Theorem |3.1| due to Schoenberg. 



Proposition 7. The quantities B[3 r (p,o~)] and C[3 r (p,a)] given in Eqs. ( 3. IS ) and ( 3.14 ), 
respectively, are metrics for the space of density matrices. 

Proof. For brevity, let K[3 r (p,cr)} represent either B[3 r (p,cr)] or C[3 r (p,a)]. As with 3", it is 
easy to check that K 2 \ft(p, a)] is symmetric in its two arguments, and that i^ 2 [9 r (p, a)] > 
with saturation iff p = a . So, according to Theorem |3.1| , K[3(p, a)] is a metric if for any 
set of density matrices {pj}™ =1 (n > 2) and real numbers {cj}™ =1 such that YH=i c « = 0' ^ * s 
true that 

n 

K 2 [?(Pi, Pj)}dCj <o. (B.ll) 

To prove this, we derive an upper bound for K 2 ^{pi, pj)] that can be easily seen to satisfy 
the condition above. First, note that 

^(pupj) = (Tr Iv^v^il) 2 ^ \ Tl VPiVPj\ 2 = 

(Tr v^v^7) 2 = Tr [^pl^fp] <8> ^PiVPl] = 

^[{VPi®Vft(VPj®^)]=A{pi, Pj ), (B.12) 



where the first equality follows from the definition \A\ := V A^A for every matrix A and the 
inequality from the fact that Tr \A\ = max^ | Tr C/v4 1 (the maximization runs over unitary 
matrices U Jf?, 129| ). Then, it follows that 



B 2 ^} = 2 (l - Vfj < 2 (1 - 3) < 2 (1 - A) , 
C 2 [3] = 1 - 3 < 1 - A < 2 (1 - A) , 



(B.13) 
(B.14) 



or, in our more compact notation, il" 2 [9 r ] < 2(1 — A). 
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Now, replacing K 2 [9 r (pj, pj)} with the above upper bound in the lhs of Eq. QB.ll ), it is 
easy to obtain the desired inequality: 



{2-2Tr [(v/p~®v/p7) (VP] ® VP])]} = "2Tr 



i=l 



<0, (B.15) 



where the equality is obtained by using that Ym=i Ci = ^' ^ ne linearity of the trace operation 
and the hermiticity of Ci^fpi ® J~pl. 

□ 

Finally, let us just mention that besides establishing the metric properties of Bft] and 



C[3], the present proof also establishes y 2 — 2 (Tr y/py/a) as a metric for the space of 
density matrices. In fact, by a similar application of Schoenberg's theorem, the quantity 



H(p, a) := J 2 — 2 Tr yff>\fo can also be shown to be a metric. 



B.4 Proof of the Metric Property of !K" 



In Sec. |3.3.1.2| we defined the Hilbert-Schmidt distance 'K between two density matrices p 
and a as the Hilbert-Schmdit norm of the matrix p — a. As shown on page |50|, such a 
definition guarantees that !K is a metric for the space of density matrices. In this appendix, 
we demonstrate that the function 



tf(p,v)=Tr[(p -of] 
also defines a metric for the space of density matrices. 



^B.16) 



First note that properties (M1)-(M3) on page ^T] are easily verified for "K 2 , as a conse- 
quence of holding true for "K. We only have to check the triangle inequality (M4), which is 
necessarily satisfied if 



= ^ ^ 4 (pi,Pj)ciCj < 



;b.i7) 



1=1 



for every n > 2 and q G M. The above implication arises from Schoenberg's theorem (cf. 
Theorem I3TT 



on page 



In order to prove Eq. ( |B.17|) , consider the inequalities 



2L 

7! 



< 



2L 
7! 



< 2 (1 - 3) < 2 (1 - A) 



:b.i8) 



where the first inequality follows from the fact that J{ is bounded between and \/2, the 
second is equivalent to Eq. (|3.69|) (second inequality in the first line) and the third was 
established in Eq. ( [B. 12| ) , where the function A(p, a) was defined. Inequalities (p.l8| ) imply 
that Eq. ( |B.17|) is satisfied if the implication holds when 2(1 — ^1) replaces "K 4 . That this is 
the case, has already been seen in Eq. ( |B.15| ). 
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B.5 Matlab Codes 

In this Appendix, we present the Matlab codes that we have used to compute £Fjv, 2) and 



Q in the numerical experiment presented in Sec. |3.2.3| . For completeness, we also present 
similar codes for the computation of the metrics "K and 0. 
For rho and sigma density matrices, 

• 3^ was computed using 

Fn = real ( rho ( : ) ' * s igma ( : ) + ... 

sqrt((l - rho ( : ) ' * rho ( : ) ) * ... 
(1 - sigma (:)'* sigma (:)) ) ); 

• £F was computed using 
[V, D] = eig(rho); 

sqrtRho = V*diag ( sqrt (diag (D) ) ) *V ' ; 

F = sum( sqrt ( eig ( Hermit ize ( sqrtRho * s igma* sqrtRho )) ) )~2; 

Here sqrtRho*sigma*sqrtRho is not quite Hermitian due to small numerical errors. 
We therefore employ the function Hermit ize (M) = (M+M ' ) /2 to turn the almost-Hermitian 
matrix into a Hermitian one — this causes Matlab to select a more efficient algorithm 
for the diagonalization. 

• D was computed using 

D=0 . 5* sum ( abs ( eig (rho - sigma) )); 

• Q was computed using 

[Vr ,Drho]=eig(rho) ; Dr=diag(Drho) ; 

[Vs ,Dsigma]=eig(sigma) ; Ds=diag(Dsigma) ; 

A = abs(Vr '*Vs) . ~2; 

[x,Q]=fminbnd(@(s) (Dr. ' .~s)*A*(Ds.~(l-s)) , 0, 1); 

The algorithm used here follows from the formula for Tr(p s cr 1_s ) given in the section 
entitled convexity in s of Ref. [Si]. 

• 3~C can be computed using 

H = sqrt ( ( rho (:)- s igma (:))'*( rho (:)- s igma (:) ) ); 



• can be computed using 

0=max(abs( eig (rho - sigma) )); 
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In Sec. |5.3.1| and |5.4.6| , the proposed deterministic classical and quantum control schemes 
were shown to be optimal among the set of EBTP and CPTP maps, respectively. Here, we 
provide constructive proofs of these results in further detail. All the appendices presented 
here appear in Ref. 0. 



C.l Dual optimization for deterministic quantum con- 
trol 

As demonstrated in Sec. |5.4.6| , obtaining the maximum average fidelity can be expressed 
as the optimization problem Q5.41 ). For this problem (as for the classical problem which 



we address in the next section) the dual optimization proves to be straightforward to solve 
analytically and the results above can then be used to show optimality of the control scheme 
given by Eq. ( |5.33| ). 

We make use of some symmetry arguments to simplify the problem. This optimization 
problem has certain symmetry properties under the action of the group of transformations 
generated by the rotation <£ — > (A(g>A)(C(Acg>A)"'" and the transpose £ — > £ T . Specifically, the 
objective function is invariant under the action of this group since Tr[i?(A(g> A)£(A® A)] = 
Tr (R£) and Tr (R£ T ) = Tr (R£), because (A® X)R(X® X) = R and R T = R, respectively. 
In addition, the constraints are covariant under the action of the group: Since conjugation 
with a unitary and transposition preserve eig envalues, (A <g> A)£(A ® A) > and £ T > if 
C > 0. To see that the equality constraints are covariant note that Tr 2 £ = I2 is equivalent 
to the condition Tr [(M (g) I.2)<£] = Tr M for all hermitian M. If £ obeys the partial trace 
constraint we have 

Tr[(M <g> I2XA (g) A)£(A <g> A)] = Tr[(AMA <g> 1 2 )£] = Tr M , (C.l) 
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and 

Tr[(M ® 1 2 )£ T ] = Tr[(M T ® 1 2 )C] = Tr M , (C.2) 

so both (X®X)£(X®X) and £ T do also. So both the objective function and the feasible set 
of ( |5.41|) are invariant under the action of the group. As a result there will be an invariant 
point C* nv = (X ® X)€* nY (X ® X) = €*J V that achieves the optimum p* @. We do not need 
to optimize over the full set of € but may restrict our attention to the set of invariant £i nv . 
Gatermann and Parrilo |137| have investigated such invariant SDP's in detail. 
The dual of our optimization problem ( |5.41|) has the form [19| 



minimize Tr M 
subject to M ® t 2 - R > 



(C.3) 



Notice that (as is generally the case) this semidefmite program is invariant under the same 
group of transformations as the original problem, under which M — > XMX and M — > M T . 
For the dual problem we may likewise restrict attention to M- my = b i. 2 + b x X that are 
invariant under the action of the group. This gives a simpler dual optimization 



minimize 2bo 
subject to + b x X 



R > 



(C.4) 



where 60 and b x are the new variables. This problem is simple enough to solve analytically; 
the solution is 



1 1 / sin 4 # 

- + -Wcos 2 ^ + 

4 4 V l-rl 



(C.5) 



and b x = r x b [with r x = (1 — 2p) cos 6}. This may be checked by verifying that the matrix 
&0I4 + b x X I2 — R is indeed positive semi-definite, hence 2bo is a valid dual feasible value. 



Because 26 reproduces the fidelity of our proposed scheme, given by Eq. ( |5.36|) , this guess 
necessarily gives an optimal solution to the original problem 



C.2 Dual optimization for deterministic classical con- 
trol 

The same approach is used to solve the problem ( |5.13|) . We start by mapping the set of trace- 
preserving entanglement breaking qubit channels to bipartite states 23. For these channels 23 
is positive, has partial trace equal to the identity, and is also separable ||45| . Because 23 is an 
(unnormalised) state of two qubits, the separability condition is equivalent to the positivity 
of the partial transpose |3f| . We will denote the partial transpose of the operator 23 on the 



second subsystem H by 23 T ' 2 . Thus we may rephrase the optimization problem ( |5.13| ) in the 
form 

maximize Tr (-R23) 

subject to 23 > , 23" 1 " 2 > (C.6) 
Tr 2 23 = 1 2 . 

Note that the condition of positivity of the partial transpose guarantees that 23 corresponds 
to an entanglement breaking map. 



C.2 Dual optimization for deterministic classical control 
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The new problem has the same symmetries as the full optimization ( |5.41 ) with one 



addition. Notice that R Ti = R so the objective function of both problems is invariant 
under partial transpose. In our new problem the point Q3 T2 is feasible if 03 is feasible, 
so the feasible set is also invariant under the partial transpose. [Note that since partial 
transpose does not preserve positivity this is not true of the problem ( |5.41|) 1 . Because of this 
symmetry we may restrict our attention to Q3i nv for which 53^ = 23mv Since the partial 
transpose sends A ® Y — > —A cg> Y where A is any Hermitian matrix, we can conclude that 
Tr[(y4 (g> y)23i nv ] = 0. It is sufficient to check this condition for the full set of Pauli matrices 
1 2 , X, Y, Z so the requirement of invariance under the partial transpose constitutes four new 
constraints. Notice however that the condition 23^ > is now redundant since we are 
requiring that Q3j^ = 23; nv . So we can replace the problem ( |C.6| ) with 

maximize Tr [-RQ3] 

subject to 23 > 

Tr 2 Q3 = l 2 ^- 7j 
Tr[(A <g> Y)»] = \/A e {J, A, Y, Z}. 

Positivity of the partial transpose and hence the separability of 03 is now guaranteed by the 
positivity of 03 and the additional equality constraints. 
The dual of the problem ( |(J.7| ) is 

minimize Tr M (n o\ 

subject to M ®1 2 + N ®F - R>0 ^ ' 

This semidefinite program still has symmetries corresponding to the rotation X <g> X and 
the transpose (but not under the partial transpose.) These two symmetries lead to the 
transformations N — > — XNX and N — > —N T respectively. The only invariant choices of 
N are proportional to Y. As before we may restrict attention to M; nv = aoB-2 + CLxX that 
are invariant under the action of the group and N- my = a y Y. This gives a simpler dual 
optimization 

minimize 2ao ,„ > 

subject to a l 4 + a x X <g> 1 2 + a y Y ®Y - R>0 , ^ ' ' 

where ao, and a y are the new variables. This problem should be compared to the analogous 
dual optimization in the quantum case flC.4| ). Again, this problem can be solved analytically, 
yielding the solution 



a = - + -Vcos 2 # + sin 4 #, (CIO) 

r x r x cos 2 9 , _ _ . 

a x = JL + -£ C.ll 

4 4 Vcos 2 + sin 4 

r x cos^sin 2 ^ 

a v = — — , , . V^A2) 
4 Vcos 2 9 + sin 4 9 

Again, one can check that aoJ.4 + a x X <g> 1 2 + a y y <E> Y — R is positive semidefinite with 

these choices, which ensures that the objective function 2ao is indeed a dual feasible value. 
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The proof of optimality follows as before in the quantum case by: (i) observing that 2a 
reproduces the fidelity (J" 013112 ) of Eq. ( |5.12| ) and (ii) applying the weak duality argument. 

We note that the optimization techniques presented here may be useful when applied 
to more general problems presented in Fuchs and Sasaki [}£7|] . However, when the map in 
question does not act on qubits, there are significant complications in characterizing the 
EBTP maps because the PPT condition is no longer sufficient. 
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Except for Appendix D.3, all the appendices presented here appear in Ref. 



D.l Perfect Tracking Conditions 

A theorem closely related to the aims of this paper has been proved by Alberti and Uhlmann 



171|| , consisting of a mathematical criterion for the existence of physical operations perfectly 



transforming between pairs of qubit states. In this appendix we briefly review this theorem 
and prove an important corollary that is used in a number of places in this paper (e.g., 
sections |6.2.2.1| and |6.4.4| ) . 



Theorem D.l (Alberti and Uhlmann). Let p\, p 2; p 1; p 2 be 2 x 2 density matrices. Then 
there exists a CPTP map A such that 



Pi = Api 



and 



P2 = Mpi 



[D.l) 



if and only if 



\Pl ~ t P2\\tr < \\Pl ~ tP 



2 tr 



for all t G 



;d.2) 



where \\ ■ || tr denotes the trace norm. For higher dimensional density matrices, the above 
condition is necessary but not sufficient for the existence of A. 



As pointed out by Chefles, Jozsa and Winter ||180|| , the condition ( |D.2|) is equivalent to 
the requirement that the target states are no more distinguishable than the source states by 
minimum error probability discrimination (Helstrom |82]]), for any prior probabilities. In the 
particular case where p x and p 2 are pure states, this just means that the Bloch angle between 
~p x and p 2 is smaller than the angle between pi and This is proved in the following. 
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Corollary D.l.l. Let~p l and~p 2 be any two pure distinct qubit states separated by an angle 
G (0, 7r] in the Block representation. Let p\ and p 2 be any (mixed or pure) qubit states 
separated by G (0, it]. A CPTP map A such that 

Pi = A(pi) and p 2 = A(p 2 ) (D.3) 

exists if and only if pi and pi are also pure and < 0. 

Proof. First note that the inequality (p.2| ) can be equivalently written with both sides 
squared. Also, since p l — t~p 2 an d Pi — tp 2 are hermitian matrices, their trace norm can 
be computed as the sum of their eigenvalues. In terms of the Bloch parameters, a straight- 
forward computation gives 

lift - tp 2 \\ 2 = 4(1 + t 2 - 2tcos0) , (D.4) 

where we have made use of the fact that t G M + , and 

|| Pl - tp 2 \\ 2 = 2 [(1 - t) 2 + (R\ + t 2 R\ - 2tR 1 R 2 cos 0)] 

+ 2 | (1 - t) 2 - (R\ + t 2 R\ - 2tR t R 2 cos 0) | , (D.5) 

where Ri gives the magnitude of the Bloch vector for pi, i = 1,2. 

Now assume that the absolute value on the right hand side of Eq. ( p.5| ) can be removed, 
then the inequality ( |D.2j ) takes the form 

l + t 2 -2tcos© < {l-tf, (D.6) 

which for all t G M + is satisfied if and only if cos© = 1. However, as the (pure) target states 
are required to be distinct, we must have cos© < 1. As a result, the inequality ( p.6| ) is 
never satisfied. 

Assume then the complementary case (when the absolute value of Eq. ( p.5| ) is removed 
at the cost of a change of sign). Then ( p.2|) can be written as Fit) < with 

F(t) = (1 - R 2 2 )t 2 - 2t(cos - R X R 2 cos 0) + (1 - R\) . (D.7) 

If i?2 7^ 1, F(t) is a strictly convex function of t, therefore cannot be bounded from above 
by for all t G M + , so it is necessary that R 2 = 1 (p 2 must be pure). Then, define 
G(t) = F(t)\ R2=l , explicitly 

G(t) = -2t(cos - i?i cos 0) + (1 - Rj) , (D.8) 

and require G(t) < 0. 

If R\ 7^ 1, G(t) is a linear function of t with strictly positive linear coefficient. Again, 
such a function cannot be bounded from above by for all t G M + , so it is necessary to make 
Ri = 1 (pi must be pure). Finally, define H{t) = G(t)\ R =1 , i.e., 

H{t) = -2t(cos - cos 0) , (D.9) 

and require H(t) < 0. Clearly, this inequality is satisfied for all t G IR + if and only if 
cos© > cos©, or equivalently, < 0. □ 
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Figure D.l: Schematic for proof that i?_ > R 



D.2 Technical details 
D.2.1 Properties of S and T 

Here, we prove that S > and S + T > if and only if one of the following holds 

i) {Ri,R 2 } is linearly independent; or 

ii) {.Ri, -R2} is linearly dependent with T > 0. 
Moreover, we show that the complementary case 

iii) {Ri, R 2 } is linearly dependent with T < 0, 
occurs only if Q = 0. 

This result is useful to demonstrate that the coefficients /ii, ^2, ^3 and si defined in Eq. 
Q6.ll]) for Q > (procedure A) are always (a) well-defined, (b) real (c) within the range 
[0, 1]. We start with the following lemma 

Lemma D.l. Let R\, R2 be real three dimensional vectors such that R4 < 1 (i — 1, 2). 
Define R^ := R± — R 2 . If R- ^ (i.e., R\, R 2 are distinct), then R 2 _ > R\. 



Proof. Consider the triangle defined by the vectors Ri, R 2 and i?„ as shown in Fig. |D.l . 
The magnitude of R x gives 2 times the area of this triangle so that 

R 2 X = h 2 R 2 _ , (D.10) 

where h is the altitude relative to the side of length We write the following 

R 2 x = h 2 Rl < min (Rx, R 2 ) 2 Rl < R 2 _ . (D.ll) 

The first inequality is a direct consequence of the Pythagorean theorem, and the second 
follows from R4 < 1. This establishes that R 2 _ > Ri. This inequality is trivially saturated 
if i?_ = 0. To see that this is the only case where saturation occurs, assume R- 7^ and 
require saturation of both inequalities in Eq. ( P . 1 1 ) . The first inequality is saturated iff 



R 2 _ = \R\ — R\\ (by the Pythagorean theorem), and the second one iff R\ — R 2 — 1. Taken 
together, these conditions imply R_ = 0, which contradicts the hypothesis. Therefore, if 
^ (i.e., i?i ^ R 2 ), then R 2 _ > R 2 X . □ 



156 



Appendices to Chapter 6 



Now, recall that 



S= a/ T 2 + 4R? X {R 2 _ -R 2 X ). (D.12) 



Assume first linear independence of {Ri,R 2 } (i.e., R x 7^ 0). From Lemma [L>.1| , it is imme- 
diate that S > 0. Moreover, 



S + T = T + \JT 2 + AR X (R 2 _- Rl) >T+\T\ >0, (D.13) 

where the first inequality follows from Lemma p.lj and the second is trivial. Therefore, S > 
and S + T > if condition (i) holds. 

For linearly dependent {-Ri, R2}, it is easy to see that S = \T\ and S + T = \T\ + T, 
therefore S > and S + T > if condition (ii) holds. 

To prove the only if part, consider the complementary case (iii). It is immediate that 
S + T = if {R\, R2} are linearly dependent and T < 0, hence (i) and (ii) are the only 
situations where the premise holds. 

It follows trivially from the discussion above that Q = for condition (iii). Simply note 
that S + T = and the linear dependence of the targets Bloch vectors requires 2,R X R X — 0. 

D.2.2 Well-definedness of the dual feasible point 



The proposed values for the coefficients pi and P3 defined in Eqs. fl6.36c| ), ( |6.37b| ) and ( |6.37c 



have the quantities T a and r& appearing in the denominator. In this appendix we show that 
this does not lead to any singularity as long as the indicated range of Q is considered. 

To see that, note that T a = if and only if R + = R x = 0. This, in turn, is equivalent to 
the statement that the targets have Bloch vectors of same magnitude R pointing to opposite 
directions, which used in Eq. (|6.6| ) gives T = —R 2 _R . In these circumstances, f2 can be 
easily computed to be Q = T + \T\ = 0. Therefore, no singularity can occur in Eq. ( |6.36c| ) 



in the range f2 > 0. 

Similarly, T b = if and only if R + = T — 2R X R X , in which case we can write £7 = S + R + . 

In the sequence we show that S + R + > 0, thus no singularity can occur in Eqs. (|6.37b|) and 
( |6.37cj ) in the range H < 0. 

_____ 2 

From the definition of S in Eq. (|6.7f) , it is immediate that the inequality S + R + > 

2 2 

holds, so we just need to show that S + R, 7^ 0. Suppose, on the contrary, that S = —R + , 



which is possible only if S = R + = 0. From Eq. ( |6.7|) , this can be seen to be equivalent 
to T = R x = R + = 0. To see that this leads to a contradiction, use once again the fact 
that R x = R + = implies opposing target Bloch vectors of same magnitude R, which gives 

2 

T = —R 2 __R 7^ 0. The inequality follows from the conditions of the problem: the source 
states cannot be identical (_R_ 7^ 0), and the case where the two targets are identical to the 
maximally mixed states has been excluded from the analysis (R 7^ 0). 

D.2.3 Characteristic Polynomials for F 

In this appendix we compute the characteristic polynomials of the matrix F, Eq. ( |6.33|) , 
with the set of coefficients given in Eq. ( |6.36| ) (for Q > 0, procedure A) and Eq. ( p.37|) (for 
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Q < 0, procedure B). By studying these polynomials, we show that F > 0, thus completing 
the proof of the optimality of our tracking strategy. 



D. 2.3.1 Procedure A 



For the set of coefficients ( |6.36|) (case > 0), the characteristic equation for F factorizes as 
A 2 P 2 (A) = 0, where 



P 2 (A) = A 2 - T a A + v [(R 2 _ - R 2 x )Tt - ~ 2 ] 



(D.14) 



and 



V 



arir x + (s + ry 



(D.15) 



8R 2 _T 2 a S(S + T) ' 

Since both T a and v are positive, the eigenvalues of F are non-negative if the term in square 
brackets in the Eq. ( |D.14j ) is non- negative when Q > 0. We now show that this term is 
non-negative irrespective of the sign of Q. 

First use Eq. ( p. 40 ) to substitute for E 2 , after some manipulation we find that 



(R 2 



2 -r 2 



a\R% + f^\+R 2 Rl+R\T 



where we have defined 



a :- 



ml 



(i? 2 -R 2 X ) =S-T 



(D.16) 



(D.17) 



S + T 

with the second equality following from Eq. (|6.7|) . Note that the non-negativity of (i? 2 — 
R\ — S 2 cannot be immediately concluded from Eq. ( |D.16| ) — although the first and 

— 2 

second summands are non-negative, the term R + T does admit negative values. However, 
using a = S — T in Eq. ( p,16| ), after some rearrangement we get, 



(R 2 



2 -r 2 



SR 2 + + R 2 _ Rl fl + l-pl 



> 



(D.18) 



from which the fulfillment of the inequality is obvious. In conclusion, procedure A is optimal. 



D. 2.3.2 Procedure B 



For the set of coefficients (|6.37|) (case Q < 0), the characteristic equation for F is \P^{\) = 0, 
where 

P 3 (A) = A 3 - T 6 A 2 + w\ + lu, (D.19) 

and 



to 



^)RtTi-{R 2 _ +R 2 x )(e + ^ 

2\ 



(R X T 2 -Q [R 2 _Tti-R x (e + Z 
8RiT 3 b 



(D.20) 
(D.21) 
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from which it follows that the eigenvalues of F are non negative if w > and lu < when 
Q < 0. Next, we simplify Eqs. ( P-20| ) and ( p.21| ) in order to make it clear that these 
conditions are satisfied. 

It is just a matter of applying Eqs. ( |6.39| ) and ( |6.40| ) to Eq. ( p.20| ) to show that 



w = \ + S + R X R X ) > 



(D.22) 



from which the inequality is clearly seen to hold if Q < 0. 

To prove that ui < if Q < 0, consider first the term in the square brackets in Eq. 
( |JJ . 2 1| ) . Again, employing Eqs. ( |6.39| ) and ( |6.40| ) this can be simplified to i?i-R x r| which 
is obviously non-negative. Therefore, the validity of the inequality uj < if Q < is now 
solely conditioned on the validity of the inequality 



i > for < . 



(D.23) 



To see that this is so, first note that the only way to satisfy the conditions R x = and Q < 
is to have S = T = Q = 0, which implies that ( p.23|) is satisfied with saturation. Consider 
then the complementary case R x ^ and Q < 0. Using Eq. ( |6.39| ) for ^, and multiplying 
and dividing by 4i? x , we get 



AR y 



-4R x R x (tt- S) -4R X R 2 _ 



(D.24) 



Now, from Eq. 
(|D.24j ) gives, after some algebra 



we know that AR 2 X R 2 _ = S 2 - T 2 + 4R 2 X R 2 X , which used in Eq. 



n 



AR, 



(S -T + 2R X R X ) > 0. 



(D.25) 



Once again, the inequality is obviously true if Q < 0, thus establishing the optimality of 
procedure B. 



D.3 Choi matrix for optimal single-step tracking 



In Sec 2.2.3.6, we have seen that any quantum channel C on a single qubit can be represented 



with a Choi matrix £. of the form 

3 3 
k=l k=l 

where {ffc}A,-=i.2,3 arid {uk}k=i,2.3 £ are orthonormal sets of vectors and /!& and Sk are 
certain scalars within the range [—1,1]. 

In this appendix, we derive an explicit formula for the vectors and that specify the 
Choi matrix of the optimal single-step tracking map introduced in Sec. |6.2| . Optimal values 
for the scalar parameters fik and have already been given in Sec. |6.2|, where we saw that 
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S2 — S3 = 0, while S\ and \xy. are conditioned on the value of the indicator function Q [cf. 
Eq. (|6.8D]: Si = and /i^ — 1 if Q < 0, or otherwise they are given by Eqs. fl6.11| ). 
Following Eqs. ( |6.9|) and (|6.12| ), we can write 



2 2 



= h. + j 



2/2 



2f2_ 



(D.27) 
(D.28) 



for z = 1,2 and any source and target Bloch vectors Ri (with Ri ^ R 2 ) and R^ In 
the above, kn and k i2 were given in Eq. ( |6.13| ), while the symbols a and $ are shorthand 
notation for R R 

a = s\ + ^175— and = /x 3 — — = — . (D.29) 



a and /?, were more explicitly evaluated in Eqs. ( |6.15 ), ( |6.16| ) for ft > and in Eqs. (|6.19 ), 
(|O0| ) for ft < 0. 

With some vector algebra, Eqs. ( |D.27| ) and ( p.28| ) can be converted into expressions for 
V^CTjV and UajW (j = 1, 2, 3), which used in Eqs. <$2~M) and (ETMl) yield 



u 2 = R x /R x and u 3 



v 2 = R x /R, 
1 



r 



v 3 = R-/R- , 
(R+ xR x )+R x (pjR.x + (3 2 R 2 ) 



(D.30) 
(D.31) 



with Vi = v 2 x i> 3 and U\ = u 2 x u 3 , as explained in Sec. |2.2.3.6| . In Eq. ( |D.31| ), T is a 
normalization factor given in Eq. ( 15.14 ), and repeated below 



n^2 



R\ + [\\(3 1 R 1 + p 2 R 2 \\ 2 + 2a (ft - (3 2 )] r\ . 



a- 



(D.32) 



In Eq. ( |6.18|) [resp. Eq. (|6.24j) 1 a more explicitly formula for T was given in the case ft > 
[resp. ft < 0]. 



Recall, however, that the optimal unitary U is not of the form given in Eq. (6.12) when Q < and 
R x =0. As a consequence, the vector Uk derived here is not the optimal one for this particular case. 
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List of Symbols 



The following list is neither exhaustive nor exclusive, but may be helpful. 

Id The identity matrix of dimension d; 

X, Y, Z . . . . The Pauli matrices; 

0i, (72, 03 . . . . Alternative notation for the Pauli matrices X, Y, and Z, respectively. 

Aid The algebra of complex matrices of dimension d; 

M T The transpose of the matrix M; 

M Tk The partial transpose of the matrix M with respect to the k— th subsys- 
tem; 

Tr M The trace of the matrix M; 

Tr^M The partial trace of the matrix M with respect to the k— th subsystem; 

X d The identity map on Aid] 

C An arbitrary linear map from Aid to Aid', 

£. The Choi matrix of the map C; 

/Q et The set of completely positive maps from Aid to Aid] 

^ The Choi matrix of a map /C G /Q et ; 

<2 d et The subset of trace preserving maps of /C d et ; 

H The Choi matrix of a map Q G Q s d ct ; 

i3 d ct The subset Q s d ct with elements of positive partial transposed Choi matrix; 

i3 d ct The set of entanglement breaking and trace preserving maps from A^d 

to Aid] 

23 The Choi matrix of a map B G B d ct ] 

H d Hilbert space of dimension d; 
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{l^d)}a=i • • • An orthonormal bases of H d ; 

The unnormalized maximally entangled state Yla=i l^d) ® l^d) ^ Hd ® 

Hd; 

{-^d }q=i • • • • An orthonormal bases for the Hermitian matrices of dimension d [gener- 
ators of SU(d)]; 

J The Uhlmann-Jozsa fidelity; 

A, B, C[3] . . Three metrics related to 5F; 

S^n An alternative fidelity measure between mixed states; 

C[5Fjv] A metric related to 9V j 

Q The non-logarithmic variety of the quantum Chernoff bound; 

||M||tr The trace norm of a matrix M; 

||M||hs The Hilbert-Schmidt (or Frobenius) norm of a matrix M; 

||M|| The spectral (or operator) norm of a matrix M; 

T> The trace distance; 

"K The metric induced by the Hilbert-Schmidt norm; 

The metric induced by the Spectral norm; 

Ql An arbitrary function measuring the distance between density matrices; 

[pi] i=1 A sequence of I density matrices; 

(^) 12 Two possible averaging schemes for quantifying the distance *2i between 

sequences of density matrices; 
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